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HYBRID ADENOVIRUS-AAV VIRUS AND METHODS OF USE THEREOF 

This invention was supported by the National 
Institute of Health Grant No. P30 DK 47757. The United 
5 States government has rights in this invention. 

Field Qf the Invention 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
10 thereof . 



Background of the Invention 

Recombinant adenoviruses are capable of 
providing extremely high levels of transgene delivery to 

15 virtually all cell types, regardless of the mitotic 
state. High titers (10 13 plague forming units/ml) of 
recombinant virus can be easily generated in 293 cells ^ 
(the adenovirus equivalent to retrovirus packaging cell 
lines) and cryo-stored for extended periods without 

20 appreciable losses. 

The primary limitation of this virus as a 
vector resides in the complexity of the adenovirus 
genome. A human adenovirus is comprised of a linear, 
approximately 36 kb double-stranded DNA genome, which is 

25 divided into 100 map units (m.u.), each of which is 360 
bp in length. The DNA contains short inverted terminal 
repeats (ITR) at each end of the genome that are required 
for viral DNA replication. The gene products are 
organized into early (El through E4) and late (LI through 

30 L5) regions, based on expression before or after the 
initiation of viral DNA synthesis [see, e.g., Horwitz, 
Virology , 2d edit., ed. B. N. Fields, Raven Press, Ltd. , 
New York (1990)]. 

A human adenovirus undergoes a highly regulated 

35 program during its normal viral life cycle [Y. Yang et 
al, Proc. Natl. Acad. S ci.. USA , 91:4407-4411 (1994)]. 



W0 9d/13598 ^ ~ PCT/US95/14018 



2 

Virions are internalized by receptor-mediated endocytosis 
and transported to the nucleus where the immediate early 
genes, Ela and Elb, are expressed. Because these early 
gene products regulate expression of a variety of host 

5 genes (which prime the cell for \irus production> and are 
central to the cascade activation of early delayed genes 
(e.g. E2, E3, and E4) followed by late genes (e.g. Ll-5) , 
first generation recombinant adenoviruses for gene 
therapy focused on the removal of the El domain. This 

10 strategy was successful in rendering the vectors 

replication defective, however, in vivo studies revealed 
transgene expression was transient and invariably 
associated with the development of severe inflammation at 
the site of vector targeting [S. Ishibashi et al, L 

15 Clin. Invest ., 23:1885-1893 (1994); J. M. Wilson et al, 
Pt-oc- Hatl. Acad, Sci. . USA. £5:4421-4424 (1988); J. M. 
Wilson et al, eiln. Bio. . 3:21-26 (1991); M. Grossman et 
al, Som. Cell, and Mol. Gen.. 12:601-607 (1991)]. 

Adeno-associated viruses (AAV) have also been 

20 employed as vectors. AAV is a small, single-stranded 
(ss) DNA virus with a simple genomic organization (4*7 
kb) that makes it an ideal substrate for genetic 
engineering. Two open reading frames encode a series of 
rep and cap polypeptides. Rep polypeptides (rep78, 

25 rep68, rep62 and rep40) are involved in replication, 
rescue and integration of the AAV genome. The cap 
proteins (VP1, VP2 and VP3) form the virion capsid. 
Flanking the rep and cap open reading frames at the 5 1 
and 3 1 ends are 145 bp inverted terminal repeats (ITRs) , 

30 the first 125 bp of which are capable of forming Y- or T- 
shaped duplex structures. Of importance for the 
development of AAV vectors, the entire rep and cap 
domains can be excised and replaced with a therapeutic or 
reporter transgene [B. J. Carter, in "Handbook of 

35 Parvoviruses", ed., P. Tijsser, CRC Press, pp. 155-168 
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(1990) ] o It has been shown that the ITRs represent the 
minimal sequence required for replication, rescue , 
packaging, and integration of the AAV genome o 

The AAV life cycle is biphasic, composed of 
5 both latent and lytic episodes o y During a latent 

infection, AAV virions enter a cell as an encapsidated 
ssDNA, and shortly thereafter are delivered to the 
nucleus where the AAV D&A stably integrates into a host 
chromosome without the apparent need for host cell 

10 division o In the absence of helper virus, the integrated 
ss DMA AAV genome remains latent but capable of being 
activated and rescued • The lytic phase of the life cycle 
begins when a cell harboring an AAV provirus is 
challenged with a secondary infection by a herpesvirus or 

15 adenovirus which encodes helper functions that are 
recruited by AAV to aid in its excision from host 
chromatin [Bo Jo Carter, cited above} «> The infecting 
parental ssDSIA is expanded to duplex replicating form 
(RF) DMAs in a rej> dependent manner o The rescued AAV 

20 genomes are packaged into preformed protein capsids 

(icosahedral symmetry approximately 20 nm in diameter) 
and released as infectious virions that have packaged 
either + or « ss DMA genomes following cell lysis o 
Progress towards establishing AAV as a 

25 transducing vector for gene therapy has been slow for a 
variety of reasons . SJhile the ability of AAV to 
integrate in quiescent cell® is important in terms of 
long term expression of a potential transducing gene, the 
tendency of the integrated provirus to preferentially 

30 target only specific sites in chromosome 19 reduces its 
usefulness o Additionally, difficulties surround large- 
scale production of replication defective recombinants o 
In contrast to the production of recombinant retrovirus 
or adenovirus, the only widely recognized means for 

35 manufacturing transducing AAV virions entails co~ 
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transfection with two different, yet complementing 
plasmids. One of these contains the therapeutic or 
reporter minigene sandwiched between the two cis acting 
AAV ITRs. The AAV components that are needed for rescue 
5 and subsequent packaging of progt&ny recombinant genomes 
are provided in trans by a second plasmid encoding the 
viral open reading frames for rep and cap proteins. The 
cells targeted for transfection must also be infected 
with adenovirus thus providing the necessary helper 

10 functions. Because the yield of recombinant AAV is 
dependent on the number of cells that are transfected 
with the cis and trans-acting plasmids, it is desirable 
to use a transfection protocol with high efficiency. For 
large-scale production of high titer virus, however, 

15 previously employed high efficiency calcium phosphate and 
liposome systems are cumbersome and subject to 
inconsistencies. 

There remains a need in the art for the 
development of vectors which overcome the disadvantages 

20 of the known vector systems. 

R^iMnayy of the Invention 

In one aspect, the present invention provides a 
unique recombinant hybrid adenovirus /AAV virus, which 

25 comprises an adenovirus capsid containing selected 

portions of an adenovirus sequence, 5' and 3' AAV ITR 
sequences which flank a selected transgene under the 
control of a selected promoter and other conventional 
vector regulatory components. This hybrid virus is 

30 characterized by high titer transgene delivery to a host 
cell and the ability to stably integrate the transgene 
into the host cell chromosome in the presence of the rep 
gene. In one embodiment, the transgene is a reporter 
gene. Another embodiment of the hybrid virus contains a 

35 therapeutic transgene. In a preferred embodiment, the 
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hybrid virus has associated therewith a polycation 
sequence and the &&V rep gene<» This construct is termed 
the hybrid virus conjugate or trans-infection particle o 
In another aspect, the present invention 
5 provides a hybrid vector construct for use in producing 
the hybrid virus or viral particle described above o This 
hybrid vector comprises selected portions of an 
adenovirus sequence, 5 5 and 3° MM XTR sequences which 
flank a selected transgene under the control of a 

10 ©elected promoter and other conventional vector 
regulatory components. 

In another aspect, the invention provides a 
composition comprising a hybrid viral particle for use in 
delivering a selected gene to a host cello Such a 

15 composition may be employed to deliver a therapeutic gene 
to a targeted host cell to treat or correct a genetically 
associated disorder or disease » 

In yet another aspect, the present invention 
provides a method for producing the hybrid virus by 

20 transfecting a suitable packaging cell line with the 
hybrid vector construct of this invention o In another 
embodiment the method involves co-transfecting a cell 
line (either a packaging cell line or a non-packaging 
cell line) with a hybrid vector construct and a suitable 

25 helper virus o 

In a further aspect, the present invention 
provides a method for producing large quantities of 
recombinant MM particles with "high efficiency by 
employing the above methods, employing the hybrid vector 

30 construct of this invention and collecting the r&AV 

particles from a packaging cell line transfected with the 
vector o 

Other aspects and advantages of the present 
invention are described further in the following detailed 
35 description of the preferred embodiments thereof «, 
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w r ^f Rescript! ^ of the Drawings 

Fig. 1A is a schematic diagram of a vector 
construct pAd.AV.CMVLacZ [SEQ ID NO: 1], which contains 
(from the top in cloclcwise order) adenovirus sequence map 
5 units 0-1 (clear bar); the 5» AA/ ITR (solid bar); a CMV 
immediate early enhancer/promoter (hatched arrow) , an 
SV40 intron (clear bar), an E. coli beta-galactosidase 
cDNA (LacZ) (hatched line), an SV40 polyadenylation 
signal (clear bar), a 3' AAV ITR (solid bar), adenovirus 

10 sequence from map units 9-16 (clear bar) , and a portion 
of a pBR322 derivative plasmid (thin solid line) . 
Restriction endonuclease enzymes are identified by their 
conventional designations; and the location of each 
restriction enzyme is identification by the nucleotide 

15 number in parentheses to the right of the enzyme 

designation. 

Fig. IB is a schematic drawing demonstrating 
linearization of pAd.AV.CMVLacZ [SEQ ID NO: 1] by 
digestion with restriction enzyme Nhel and a linear 
20 arrangement of a Clal digested adenovirus type 5 with 
deletions from mu 0-1. The area where homologous 
recombination will occur (between m.u. 9-16) in both the 
plasmid and adenovirus sequences is indicated by crossed 
lines. 

25 Fig. 1C is a schematic drawing which 

demonstrates the hybrid virus Ad.AV.CMVLacZ after co- 
transfection of the linearized pAd.AV.CMVI.ac2 [SEQ ID NO: 
1] and adenovirus into 293 cells followed by 
intracellular homologous recombination. 

30 Fig. 2A-2K report the top DNA strand of the 

double-strand plasmid pAd.AV.CMVLacZ [SEQ ID NO: 1] (the 
complementary strand can be readily derived by one of 
skill in the art). With reference to SEQ ID NO: 1, 
nucleotides 1-365 are adenovirus type 5 sequences; the 5' 

35 AAV ITR sequence spans nucleotides 366-538; the CMV 
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promoter/ enhancer spans nucleotides 563-li57; the SV-40 
intron spans nucleotides 1158-1179; the LacZ gene spans 
nucleotides 1356-4827; the SV-40 poly A sequence spans 
nucleotides 4839-5037; the 3* AAV ITR spans nucleotides 
5 5053 to 5221; nucleotides 5221 to about 8100 are 

adenovirus type 5 sequences. The remaining sequences are 
non-specif ic/plasmid sequences. 

Fig. 3 is a bar graph plotting u.v. absorbance 
at 420 ran of the beta-galactosidase blue color for a 
10 control and ten putative positive clones (D1A through 

D1J) of 293 cells transfected with the recombinant hybrid 
Ad.AV.CHVLactf. Eight of the clones expressed high levels 
of enzyme. 

Fig. 4 is a schematic diagram of pRep78/52 [SEQ 
15 ID NO: 2]. This plasmid includes an AAV P5 promoter, 
Rep78, Rep52 and a poly-A sequence in a pUC18 plasmid 
background. 

Figs. 5A - 5E report nucleotides 1-4910 of the 
top DNA strand of the double-strand plasmid pRep78/52 
20 [SEQ ID NO: 2] (the complementary strand can be readily 
derived by one of skill in the art) . 

Fig. 6 is a flow diagram of the construction of 
a trans-infection particle formed by a hybrid virus, a 
poly-L- lysine sequence and attached AAV rep-containing 
25 plasmid. 

Fig. 7 is a flow diagram of the hybrid virus 1 
life cycle, in which a trans-infection particle enters 
the cell and is transported to the nucleus. The virus is 
uncoated and the rep mediates rescue of the inserted 
30 gene, which is then integrated into the chromosome of the 
host cell. 
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Detailed Descr iption of the Invention 

The present invention provides a unique gene 
transfer vehicle which overcomes many of the limitations 
of prior art viral vectors. This engineered hybrid virus 
5 contains selected adenovirus domains and selected AAV 
domains as well as a selected transgene and regulatory 
elements in a viral capsid. This novel hybrid virus 
solves the problems observed with other, conventional 
gene therapy viruses , because it is characterized by the 
10 ability to provide extremely high levels of transgene 
delivery to virtually all cell types (conferred by its 
adenovirus sequence) and the ability to provide stable 
long-term transgene integration into the host cell 
(conferred by its AAV sequences) • The adenovirus-AAV 
15 hybrid virus of this invention has utility both as a 

novel gene transfer vehicle and as a reagent in a method 
for large-scale recombinant AAV production. 

In a preferred embodiment, a trans-infection 
particle or hybrid virus conjugate composed of the hybrid 
20 Ad/ AAV virus conjugated to a rep expression plasmid via a 
poly-lysine bridge is provided. This trans-infection 
particle is advantageous because the adenovirus carrier 
can be grown to titers sufficient for high MOI infections 
of a large number of cells, the adenoviral genome is 
25 efficiently transported to the nucleus in nondividing 
cells as a complex facilitating transduction into 
mitotically quiescent cells, and incorporation of the rep 
plasmid into the trans-infection particle provides high 
but transient expression of rep that is necessary for 
30 both rescue of rAAV DNA and efficient and site-specific 
integration. 
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So <g®E\ot3^®t&®E> ®ff fel&O EgfltelLfi VO@fe®3? ^i^mo 

The hybrid virus of this invention uses 
5 adenovirus nucleic acid sequences as a shuttle to deliver 
a recombinant &&V/transgene genome to a target cello The 
DWA sequences of a number of adenovirus© types , including 
type &d5, are available from Genbanko tt® adenovirus 
sequence® eay be obtained fro® any known adenovirus type, 

10 including the presently identified 41 human types 

[Horwits et al, cited above] 0 Similarly adenoviruses 
known to infect other animals may also be employed in the 
vector constructs of this invention „ The ©election of 
the adenovirus type is not anticipated to limit the 

15 following invention • fa variety of adenovirus strains are 
available from the American Type Culture Collection, 
Rockville, Maryland, or available by request from a 
variety of commercial and institutional sources e In the 
following exemplary embodiment an adenovirus, type 5 

20 (Ad5) is used for convenience o 

The adenovirus nucleic acid sequences 
employed in the hybrid vector of this invention can range 
from a minimum sequence amount, which requires the use of, 
a helper virus to produce the hybrid virus particle, to 

25 only selected deletions of adenovirus genes, which 

deleted gene products can be supplied in the hybrid viral 
production process by a selected packaging cello 
Specifically, at a minimum, the adenovirus nucleic acid 
sequences employed in the p&d& shuttle vector of this 

30 invention are adenovirus genomic sequences from which all 
viral genes are deleted and which contain only those 
adenovirus sequences required for packaging adenoviral 
genomic DMA into a preformed capsid head* EJore 
specifically, the adenovirus sequences employed are the 

35 cis-aeting 5° and 3° inverted terminal repeat (ITR) 



• # 
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sequences of an adenovirus (which function as origins of 
replication) and the native 5° packaging/enhancer domain, 
that contains sequences necessary for packaging linear Ad 
genomes and enhancer elements for the El promoter o 
5 According to this invention , the entire adenovirus 5° 
sequence containing the 5° ITR and packaging/ enhancer 
region can be employed as the 5° adenovirus sequence in 
the hybrid virus o This left terminal (5°) sequence of 
the Ad5 genome useful in this invention spans bp 1 to 
10 about 360 of the conventional adenovirus genome, also 
referred to as map units 0-1 of the viral genome, and 
generally is from about 353 to about 3S0 nucleotides in 
lengtho This sequence includes the 5° ITR (bp 1-103 of 
the adenovirus genome) ,° and the packaging/enhancer domain 
15 (bp 194-358 of the adenovirus genome) o Preferably, this 
native adenovirus 5° region is employed in the hybrid 
virus and vector in unmodified formo Alternatively 0 
corresponding sequences from other adenovirus types may 
be substituted o These Ad sequences may be modified to 
20 contain desired deletions, substitutions, or mutations, 
provided that the desired function is not eliminated . 

The 3 D adenovirus sequences of the hybrid virus 
include the right terminal (3°) ITR sequence of the 
adenoviral genome spanning about bp 35,353 - end of the 
25 adenovirus genome, or map units «98o 4-100 • This sequence 
is generally about 580 nucleotide in lengtho This entire 
sequence is desirably employed as the 3° sequence of a 
hybrid virus • Preferably, the native adenovirus 3° 
region is employed in the hybrid virus in unmodified 
30 form* However, as described above with respect to the 5° 
sequences, some modifications to these sequences which do 
not adversely effect their biological function may be 
acceptable . As described below, when these 5 B and 3° 
adenovirus sequences are employed in the hybrid vector, a 
35 helper adenovirus which supplies all other essential 
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genes for viral formation alone or with a packaging cell 
line is required in the production of the hybrid virus or 
viral particle* 

Alternative embodiments of the hybrid 
5 virus employ adenovirus sequences in addition to the 

minimum sequences, but which contain deletions of all or 
portions of adenovirus genes. For example, the 
adenovirus immediate early gene Ela (which spans mu 1.3 
to 4.5) and delayed early gene Elb (which spans mu 4.6 to 

10 11.2) should be deleted from the adenovirus sequence 
which forms a part of the hybrid vector construct and 
virus. Alternatively, if these sequences are not 
completely eliminated, at least a sufficient portion of 
the Ela and Elb sequences must be deleted so as to render 

15 the virus replication defective. These deletions, 
whether complete or partial, which eliminate the 
biological function of the gene are termed " functional 
deletions" herein. 

Additionally, all or a portion of the 

20 adenovirus delayed early gene E3 (which spans mu 76.6 to 
86.2) may be eliminated from the adenovirus sequence 
which forms a part of the hybrid virus. The function of 
E3 is irrelevant to the function and production of the 
hybrid virus. 

25 All or a portion of the adenovirus delayed 

early gene E2a (which spans mu 67.9 to 61.5) may be 
eliminated from the hybrid virus. It is also anticipated 
that portions of the other delayed early genes E2b (which 
spans mu 29 to 14.2) and E4 (which spans mu 96.8 to 91.3) 

30 may also be eliminated from the hybrid virus and from the 
vector. 

Deletions may also be made in any of the 
late genes LI through L5, which span mu 16.45 to 99 of 
the adenovirus genome. Similarly, deletions may be 
35 useful in the intermediate genes IX which maps between mu 
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9.8 and 11.2 and IVa 2 which maps between 16.1 to 11.1. 
Other deletions may occur in the other structural or non- 
structural adenovirus. 

The above discussed deletions may occur 

5 individually, i.e., an adenovirus sequence for use in the 
present invention may contain deletions of El only. 
Alternatively, deletions of entire genes or portions 
effective to destroy their biological activity may occur 
in any combination. For example, in one exemplary hybrid 

10 vector, the adenovirus sequence may contain deletions of 
the El genes and the E3 gene, or of the El, E2a and E3 
genes, or of the El and E4 genes, or of El, E2a and E4 
genes, with or without deletion of E3, and so on. 

The more deletions in the adenovirus 

15 sequence up to the minimum sequences identified above 
that characterize the hybrid virus, the larger the 
sequence (s) of the other below-described components to be 
inserted in the hybrid vector. As described above for 
the minimum adenovirus sequences, those gene sequences 

20 not present in the adenovirus portion of the hybrid virus 
must be supplied by either a packaging cell line and/or a 
helper adenovirus to generate the hybrid virus. 

In an exemplary hybrid virus of this invention 
which is described below and in Example 1, the adenovirus 

25 genomic sequences present are from mu 0 to 1, mu 9 to 
78.3 and mu 86 to 100 (deleted sequences eliminate the 
Ela and Elb genes and a portion of the E3 gene) . From 
the foregoing information, it is expected that one of 
skill in the art may construct hybrid vectors and viruses 

30 containing more or less of the adenovirus gene sequence. 

The portions of the adenovirus genome in 
the hybrid virus permit high production titers of the 
virus to be produced, often greater than lxlO 15 pfu/ml. 
This is in stark contrast to the low titers (lxlO 6 

35 pfu/ml) that have been found for recombinant AAV. 
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So ^^^^smmm^j>^ ftfo? y^9K m# v&m& 

Also part of the hybrid vector© and 
viruses of this invention are sequences of an adeno~ 
associated virus o The ©©querns®® useful in the hybrid 
5 vector are the viral sequence© .Jrorn which the r®p and cap 
polypeptide encoding sequence© are deleted* Klore 
specifically, the sequence© employed are the ci©~ 
acting 5° and 3° inverted terminal repeat (ITR) sequences 
[See, ©o^o, ®o Jo Carter, in ^Handbook of Parvoviruses 00 , 

10 edo, Po Tijsser, 6RC Prdss, pp 0 155-l<S8 (1990) 3 ° &® 
stated above, the ITR sequences are about 143 bp in 
length o Substantially the entire sequences encoding the 
ITRs are used in the vectors, although some degree of 
minor ©edification of these sequences is ejected to be 

15 permissible for this use G See, d«go, 93/24641, 

published December 9, 1993 o The ability to modify these 
ITR sequences is within the skill of the arto For 
suitable techniques, see, © 0 g 0 , tessts such as Sambrook et 
al, _°°E3pl©eular. ^Cloning ? Ji Laboratory ISanualo 00 , 2d edit o , 

20 Cold Spring Harbor Laboratory, BSew York (1989) o 

The MM ITR sequences may be obtained from 
any known &&V, including presently identified human 
types o Similarly, Mfflm known to infect other animals may 
also be employed in the vector constructs of this 

25 invention o The ©election of the MM is not anticipated 
to limit the following invention * h variety of MM 
©trains, types are available from the American Type 

Culture Collection or available by request fro© a variety 
of commercial and institutional sources o In the 

30 following exemplary embodiment an is used for 

convenience o 

In the hybrid vector construct, the &&V 
sequence© are flanked by the ©elected adenovirus 
sequences discussed above □ The 5 fl and 3 9 ITR 
35 sequences themselves flank a selected transgene sequence 
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ana associated regulatory elements, ^ ibed ^' 
" the sequence formed by the transgene and flanking 
VandT ^sequences may be inserted at any deletion 
site in the adenovirus sequences of the vector. For 
, exalp e, the «V sequences are d-sirably inserted at the 
UteVthe deleted Ela/Elb gene, of the adenovirus. 
! e attar map unit 1. Alternatively, the «v sequences 
^'U inserted at an E3 deletion, E2a deletion, and so 
" K only the adenovirus 5* ITR/p.c*agin, sequences 

0 Z ^sequences « — * «- VirU8 ' 
WW sequences are inserted between then. 

c. mn Tram-* ™™™nnf of ft* W«M 

Kfi££fi£ - aBSUaa ^n. transgene sequence of the vector and 
L5 recombinant virus is a nucleic acid sequence « «™» e 
transcript thereof, heterologous to the adenovirus 
seance! which encodes . peptide or protein of 
interest. The transgen. is operatively linked to 

in ^ whlch • ^ " anS9 

20 «» SCripti ° n - ^ ^.ition of the transgene sequence 
will depend upon the use to which the resulting hybrid 

be put. For example, one type of transgene 
sequence include, a reporter sequence, which 
25 expression produces a detectable signal, such reporter 
sequences include without limitation an >. 
qafactosid... OKA. - --Une phosph t„ e^ 

and a green fluoresce protein gene. These s qu 
when associated with regulatory elements which drive 
,0 their expression, provide signals detectab e by 
conventional means, e.g., ultraviolet wavelength 
absorbance, visible color change, etc. 

Another type of transgene sequence 
includes a therapeutic gene which expresses . desired 
35 qene product in a host cell. These therapeutic genes 
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nucleic acid sequences typically encode products for 
administration and agression in a patient in vivo or «r 
vivo to replace or correct an inherited or non-inherited 
genetic defect or treat an epigenetic disorder or 
5 disease • Such therapeutic gene&» which are desirable for 
the performance of gene therapy include, without 
limitation, a normal cystic fibrosis transmembrane 
regulator (CWTR) gene, a low density lipoprotein (LBL) 
gene, and a number of genes which say be readily selected 

10 by one of skill in the arto The selection of the 

transgene is not considered to be a limitation of this 
invention, as such selection is within the knowledge of 
those skilled in the arto 

Do jfomlatary 8X®m®nfc® of thsuBytorM VQGtQK 

15 in addition to the ©ajor elements 

identified above for the hybrid vector, io<a., the 
adenovirus sequences, sequences and the transgene, 
the vector also includes conventional regulatory elements 
necessary to drive expression of the transgene in a cell 

20 transfected with the hybrid vector • Thus the vector 
contains a selected promoter which is linked to the 
transgene and located, with the transgene, between the 
XTR sequences of the vector 0 

Selection of the promoter is a routine 

25 matter and is not a limitation of the hybrid vector 

itself o Useful promoters may be constitutive promoters 
or regulated (inducible) promoters, which will enable 
control of the amount of the transgene to be escpressedo 
For example, a desirable promoter is that of the 

30 cytomegalovirus immediate early promoter/ enhancer [see, 
e 0 g 0 , Boshart et al, C&XX. &As521«530 (1985) ]o Other 
desirable promoters include, without limitation, the Rous 
sarcoma virus LTR promoter/enhancer and the chicken Jl- 
actin promoter . Still other promoter /enhancer sequences 

35 may be selected by one of skill in the arto 
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The vectors will also desirably contain 
nucleic acid sequences heterologous to the adenovirus 
sequences including sequences providing signals required 
for efficient polyadenylation of the transcript and 
5 introns with functional splice donor and acceptor sites. 
A common poly-A sequence which is employed in the 
exemplary vectors of this invention is that derived from 
the papovavirus SV-40. The poly-A sequence generally is 
inserted in the vector following the transgene sequences 

10 and before the 3 1 AAV ITR sequence. A common intron 

sequence is also derived from SV-40, and is referred to 
as the SV-40 T intron sequence. A hybrid vector of the 
present invention may also contain such an intron, 
desirably located between the promoter /enhancer sequence 

15 and the transgene. Selection of these and other common 
vector elements are conventional and many such sequences 
are available [see, e.g., Sambrook et al, and references 
cited therein] • The DNA sequences encoding such 
regulatory regions are provided in the plasmid sequence 

20 of Fig. 2 [SEQ ID NO: 1). 

The combination of the transgene, 
promoter /enhancer, the other regulatory vector elements 
and the flanking 5» and 3 1 AAV ITRs are referred to as a 
"minigene" for ease of reference herein. As above 

25 stated, the minigene is located in the site of any 

selected adenovirus deletion in the hybrid virus. The 
size of this minigene depends upon the amount and number 
of adenovirus sequence deletions referred to above. Such 
a minigene may be about 8 kb in size in the exemplary 

30 virus deleted in the El and E3 genes, as described in the 
examples below. Alternatively, if only the minimum 
adenovirus sequences are employed in the virus, this 
minigene may be a size up to about 30 kb. Thus, this 
hybrid vector and vector permit a great deal of latitude 

35 in the selection of the various components of the 
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minigene, particularly the transgene, with regard to 
siseo Provided with the teachings of this invention, the 
design of ©inch a minigene can be Made by resort to 
conventional techniques <> 

So Evbrid Victor ^^mbXv._m<LJ^mGtXQJlLJl^ 

The material fro® which the sequences used 
in the hybrid vector, helper viruses, if needed, and 
recombinant hybrid virus (or viral particle) are derived 
and the various vector components and sequences employed 
in the construction of the hybrid vectors of this 
invention are obtained from commercial or academic 
sources based on previously published and described 
materials o These materials may also be obtained from an 
individual patient or generated and selected using 
standard recombinant molecular cloning techniques taoun 
and practiced by those skilled in the arto May 
modification of existing nucleic acid sequences forming . 
the vectors and viruses, including sequence deletions, 
insertions, and other mutations are also generated using 
standard techniques . 

Assembly of the ©elected DMA sequences of 
the adenovirus, the AAV and the reporter genes or 
therapeutic genes and other vector elements into the 
hybrid vector and the use of the hybrid vector to produce 
a hybrid virus utilise conventional techniques, such as 
described in Example lo Such techniques include 
conventional cloning techniques of cDNA such as those 
described in texts [Sambrook et al, cited above], use of 
overlapping oligonucleotide sequences of the adenovirus 
and AAV genomes, polymerase chain reaction, and any 
suitable method which provides the desired nucleotide 
sequence • Standard transfection and co-transfection 
techniques are employed, e 0 go, CaPO^ transfection 
techniques using the complementation human embryonic 
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kidney (HEK) 293 cell line (a human kidney cell line 
containing a functional adenovirus Ela gene which 
provides a transacting Ela protein) . Other conventional 
methods employed in this invention include homologous 
5 recombination of the viral genomes, plaguing of viruses 
in agar overlay, methods of measuring signal generation, 
and the like. 

As described in detail in Example 1 below 
and with resort to Fig. 1, a unique hybrid virus of this 

10 invention is prepared which contains an El-deleted, 

partially E3 deleted, adenovirus sequence associated with 
a single copy of a recombinant AAV having deletions of 
its rep and cap genes and encoding a selected reporter 
transgene. Briefly, this exemplary hybrid virus was 

15 designed such that the AV.CMVLacZ sequence [SEQ ID NO: 1) 
(a minigene containing a 5»AAV ITR, a CMV promoter, an 
SV-40 intron, a LacZ transgene, an SV-40 poly-A sequence 
and a 3' AAV ITR) was positioned in place of the 
adenovirus type 5 (Ad5) Ela/Elb genes, making the 

20 adenovirus vector replication defective. 

Because of the limited amount of 
adenovirus sequence present in the hybrid vectors of this 
invention, including the pAV.CHVLacZ [SEQ ID NO: 1] 
above, a packaging cell line or a helper adenovirus or 

25 both may be necessary to provide sufficient adenovirus 

gene sequences necessary for a productive viral infection 
to generate the hybrid virus. 

Helper viruses useful in this invention 
contain selected adenovirus gene sequences not present in 

30 the hybrid vector construct or expressed by the cell line 
in which the hybrid vector is transfected. Optionally, 
such a helper virus may contain a second reporter 
minigene which enables separation of the resulting hybrid 
virus and the helper virus upon purification. The 
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construction of desirable helper viruses is within the 
skill of the arte 

As one example , if the cell line employed 
to produce the recombinant virus is not a packaging cell 
5 line, and the hybrid vector contains only the minimus 
adenovirus sequences identified above , the helper virus 
may be a wild type Ad virus o Thus, the helper virus 
supplies the necessary adenovirus early genes 81, S2a, S4 
and all remaining late, intermediate, structural and non- 
10 structural genes of the adenovirus genome <> However, if, 
in this situation, the packaging cell line is 293, which 
supplies the SI proteins, the helper virus need not 
contain the SI gene<, 

In another embodiment, when the hybrid 
15 construct is rendered replication defective by a 

functional deletion in SI but contains no other deletions 
in Ad genes necessary for production of an infective 
viral particle, and the 293 cell line is employed, no 
helper virus is necessary for production of the hybrid 
20 virus o Additionally, all or a portion of the adenovirus 
delayed early gene S3 (which span® ®u 7SoS to ®6o2) may 
be eliminated from the helper virus useful in this 
invention because this gene product is not necessary for 
the formation of a functioning hybrid virus particle . 
25 It should be noted that one of skill in 

the art may design other helper viruses or develop other 
packaging cell lines to complement the adenovirus 
deletions in the vector construct and enable production 
of the hybrid virus particle, given this information • 
30 Therefore, this invention is not limited by the use or 
description of any particular helper virus or packaging 
cell lineo 

Thus, as described in Figs« 1A through 1C, 
the circular plasmid pAd o AV <> cmLacZ [SEQ ID S30s 1] 
35 (containing the minigene and only adenovirus sequences 
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from map unit 0 tc 1 and 9 to 16) was digested and co- 
transfected with a selected Ad5 helper virus (containing 
adenovirus sequences 9 to 78 o <8 and 86 to 100) into 293 
cells o Thus, the packaging cell line provides the El 
5 proteins and the helper virus provides all necessary 
adenovirus gene sequence© subsequent to map unit 16 <> 
Homologous recombination occurs between the helper virus 
and the hybrid vector, resulting in the hybrid viral 
particle o Growth of this hybrid viral particle in 293 
10 cell© has been closely monitored for greater than 20 
rounds of amplification with no indication of genome 
instability o Rescue and integration of the transgene 
from the hybrid virus into a host cell and further 
modifications of the vector are described below o The 
15 resulting hybrid virus MohVoCmL&cZ combines the high 
titer potential of adenovirus with the integrating 
biology associated with MM latency <, 

Go ffvb^id Vizus PoXvcBtion Coniucm&m 

Rep easpression is required for rescue of 
20 the rAAW genome to occur o A preferred approach is to 

synthetically incorporate a plasmid permitting esspression 
of rep into the hybrid particle <> To do so, the hybrid 
viruses described above are further modified by resort to 
adenovirus=polylysine conjugate technology** See, e<>go, 
25 Wu et al, ^ n MqL Ch@m 0 . Ms 16985-16987 (1989)? and Ko 
Jo Fisher and Jo £3o Wilson, Biochom^L^ 22£s 49 (April 
1, 1994), incorporated herein by reference <> Using this 
technology, a hybrid virus as described above is modified 
by the addition of a poly-cation sequence distributed 
30 around the capsid of the hybrid viral particle <> 

Preferably, the poly-cation is poly- lysine, which 
attaches around the negatively-charged virus to form an 
estternal positive charge <> A plasmid containing the AAV 
rep gene (or a functional portion thereof) under the 
35 control of a suitable promoter is then completed directly 
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to the hybrid capaid, resulting in a ©ingle viral 
particle containing the hybrid virus and an MM r®p gene<> 
The negatively charged plasmid bind© with high 
affinity to the positively charged poly lysine „ 
5 Essentially the technique© employed in constructing this 
hybrid virus conjugate or trans-infection particle are as 
described in detail in Ess&^ple 3 below <> 

Mi alternative embodiment of the hybrid 
vector and resulting viral particle is provided by 

10 altering the rep containing plasmid to also contain an 
MM cap gene 0 This embodiment of the hybrid vector when 
in a host cell is thus able t© produce a recombinant MM 
particle, as discussed in more detail below <> 

The plasmids employed in these embodiments 

15 contain conventional plasmid sequences, which place a 
selected MM sequence, i 0 e 0 , r<s>p and/or cap gene 
sequences, under the control of a selected promoter <> In 
the example provided below, the exemplary plasmid is 
pRep78/52 [SEQ ID MOs 2], a transacting plasmid 

20 containing the MM sequences that encode rep, 78 kD and 52 
kD proteins under the control of the MM P5 promoter • 
The plasmid also contains an SV40 polyadenylation signal <, 
The DSflA sequence of this plasmid is provided in Fig* 8 
[SEQ ID m% 2]o 

25 In a similar manner and with resort to 

plasmid and vector sequences known to the art, analogous 
plasmids may be designed using both rep and cap genes, 
and different constitutive or regulated promoters, 
optional poly-& sequences and intronso 

30 The availability of materials to make 

these modified hybrid vectors and viruses and the MM rep 
and/or cap containing vector© and the techniques involved 
in the assembly of the hybrid vector and rep and/ or cap 
containing plasmids are conventional as described above « 

35 The assembly techniques for the trans-infection particle 
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employ the techniques described above for the hybrid 
vector and the techniques of Wu et al and Fisher et al, 
cited above. The use of this trans-infection particle 
including rescue and integration of the transgene into 
5 the host cell is described below* 

II. Function of the Hybrid Virus 

A. The Hybrid Vims Infects a Target Cell 
Once the hybrid virus or trans-infection 

10 particle is constructed as discussed above, it is 

targeted to, and taken up by, a selected target cell. 
The selection of the target cell also depends upon the 
use of the hybrid virus, i.e., whether or not the 
transgene is to be replicated in vitro for production of 

15 a recombinant AAV particle, or ex vivo for production 
into a desired cell type for redelivery into a patient, 
or in vivo for delivery to a particular cell type or 
tissue. Target cells may therefor be any mammalian cell 
(preferably a human cell). For example, in in vivo use, 

20 the hybrid virus can target to any cell type normally 
infected by adenovirus, depending upon the route of 
administration, i.e., it can target, without limitation, 
neurons, hepatocytes, epithelial cells and the like. 
Uptake of the hybrid virus by the cell is caused by the 

25 infective ability contributed to the vector by the 
adenovirus and AAV sequences. 

B. The Transaene is Rescued. 

Once the hybrid Virus or trans-infection 
particle is taken up by a cell, the AAV ITR flanked 

30 transgene must be rescued from the parental adenovirus 
backbone. Rescue of the transgene is dependent upon 
supplying the infected cell with an AAV rep gene. Thus, 
efficacy of the hybrid virus can be measured in terms of 
rep mediated rescue of rAAV from the parental adenovirus 

35 template. 
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The rep genes can be supplied to the 
hybrid virus by several Methods o One embodiment for 
providing rep proteins in trans was demonstrated with the 
exemplary hybrid virus AdoAVoCKVLac^ by transfecting into 
5 the target monolayer of cells previously infected with 
the hybrid vector , a liposome enveloped plasmid pRep73/52 
[SSQ ID MOs 2] containing the genes encoding the MM rep 
78 kDa and 52 kDa proteins under the control of the MM 
P5 promoter o Rescue and amplification of a double^ 
10 stranded MM monomer and a double-stranded MM dimer, 

each containing the L&cZ transgene described above , was 
observed in 293 cells o This is described in detail in 
Sample 2» 

The production of rep in trans can be 

15 modulated by the choice of promoter in the rep containing 
plasmido If high levels of rep egression are important 
early for rescue of the recombinant MM domain, a 
heterologous (non-adenovirus, nonHk&V) promoter may be 
employed to drive esspression of rep and eliminate the 

20 need for SI proteins <> Alternatively, the low levels of 
rep esepression from P5 that occur in the absence of 
adenovirus El proteins ©ay be sufficient to initiate 
rescue and optimal to drive integration of the 
recombinant MM genome in a selected useo 

25 Klore preferably for in vivo use, the MM 

rep gene may also be delivered as part of the hybrid 
virus o One embodiment of this single particle concept is 
the polycation conjugated hybrid virus (see Fig<> 7) • 
Infection of this trans-infection particle is 

30 accomplished in the same manner and with regard to the 
same target cells as identified above o The poly lysine 
conjugate of the hybrid virus onto which was directly 
completed a plasmid that encoded the rep 78 and 52 
proteins, combines all of the functional components into 

35 a single particle structure. Thus, the trans- infect ion 
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particle permits delivery of a single particle to the 
cell, which is considerably more desirable for 
therapeutic use. Similar experiments to demonstrate 
rescue of the transgene from the hybrid conjugate trans- 
5 infection particle in 293 cells and in HeLa cells are 
detailed in Example 4. 

In another embodiment, the hybrid virus is 
modified by cloning the rep cDNA directly into the 
adenovirus genome portion of the hybrid vector. Because 
10 it is known that even residual levels of rep expression 
can interfere with replication of adenovirus DNA, such 
incorporation of rep into the hybrid vector itself is 
anticipated to requires possible mutation of the rep 
genes to encode only selected domains, and the use of 
15 inducible promoters to regulate rep expression, as well 
as careful placement of the rep genes into the adenovirus 
sequences of the hybrid vector. 

C. Transgene integrates into ChroaoffOme 

Once uncoupled (rescued) from the genome 
20 of the hybrid virus, the recombinant AAV/transgene 

minigene seeks an integration site in the host chromatin 
and becomes integrated therein, providing stable 
expression of the accompanying transgene in the host 
cell. This aspect of the function of the hybrid virus is 
25 important for its use in gene therapy. The AAV/ 

transgene minigene sequence rescued from the hybrid virus 
achieves provirus status in the target cell, i.e., the 
final event in the hybrid lifecycle (Pig. 7). 

To determine whether the AAV minigene 
30 rescued from the hybrid virus achieves provirus status in 
a target cell, non-El expressing HeLa cells were infected 
with the hybrid vector-poly-Lysine conjugate complexed 
with pRep78/52 [SEQ ID NO: 2] and passaged until stable 
colonies of LacZ expressing cells are evident. A 
35 duplicate plate of cells was infected with the same 
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conjugate, but instead of being complexed with the 
pRep78/52 plasmid [SEQ ID NO: 2], carried an irrelevant 
plasmid. Cells that receive the rep containing hybrid 
particle produced a greater number of stable LacZ 
5 positive colonies than cells inxected with the control 
vector. This indicates multiple rescue and integration 
events in cells that expressed rep proteins. 
Confirmation of integration is revealed by characterizing 
the recombinant AAV genome in the hybrid infected cells 
10 and identifying flanking chromosomal sequences (see 
Example 5) . 

III. Use of the Hybrid Viruses and Viral Particles 

in Gene Therapy 

15 The novel hybrid virus and trans-infection 

particles of this invention provide efficient gene 
transfer vehicles for somatic gene therapy. These hybrid 
viruses are prepared to contain a therapeutic gene in 
place of the LacZ reporter transgene illustrated in the 

20 exemplary vector. By use of the hybrid viruses and 
trans-infection particles containing therapeutic 
transgenes, these transgenes can be delivered to a 
patient in vivo or ex vivo to provide for integration of 
the desired gene into a target cell. Thus, these hybrid 

25 viruses and trans-infection particles can be employed to 
correct genetic deficiencies or defects. Two examples of 
the generation of gene transfer vehicles for the 
treatment of cystic fibrosis and familial 
hypercholesterolemia are described in Examples 6 and 7 

30 below. One of skill in the art can generate any number 
of other gene transfer vehicles by including a selected 
transgene for the treatment of other disorders. For 
example, the trans-infection particles are anticipated to 
be particularly advantageous in ex vivo gene therapy 
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where transduction and proviral integration in a stem 
cell is desired, such as in bone marrow directed gene 
therapy. 

The hybrid viruses and trans-infection 
5 particles of the present invention may be administered to 
a patient, preferably suspended in a biologically 
compatible solution or pharmaceutical^ acceptable 
delivery vehicle. A suitable vehicle includes sterile 
saline. Other aqueous and non-aqueous isotonic sterile 
10 injection solutions and aqueous and non-aqueous sterile 
suspensions known to be pharmaceutical^ acceptable 
carriers and well known to those of skill in the art may 
be employed for this purpose. 

The hybrid viruses and trans-infection 
15 particles of this invention may be administered in 

sufficient amounts to transfect the desired cells and 
provide sufficient levels of integration and expression 
of the selected transgene to provide a therapeutic 
benefit without undue adverse or with medically 
20 acceptable physiological effects which can be determined 
by those skilled in the medical arts. Conventional and 
pharmaceutically acceptable routes of administration 
include direct delivery to the target organ, tissue or 
site , intranasal , intravenous , intramuscular , 
25 subcutaneous, intradermal, oral and other parental routes 
of administration. Routes of administration may be 
combined, if desired. 

Dosages of the hybrid virus and/or trans- 
infection particle will depend primarily on factors such 
30 as the condition being treated, the selected gene, the 

age, weight and health of the patient, and may thus vary 
among patients. A therapeutically effective human dose 
of the hybrid viruses or trans-infection particles of the 
present invention is believed to be in the range of from 
35 about 20 to about 50 ml of saline solution containing 
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concentrations of from about 1 x 10 7 to 1 x 10 10 pfu/ml 
hybrid virus of the present invention. A preferred human 
dose is about 20 ml saline solution at the above 
concentrations. The dosage will be adjusted to balance 
5 the therapeutic benefit against any side effects. The 
levels of expression of the selected gene can be 
monitored to determine the selection, adjustment or 
frequency of dosage administration. 

10 IV. High Efficiency Production of rAAV 

The hybrid viruses and trans-infection 
particles of this invention have another desirable 
utility in the production of large quantities of 
recombinant AAV particles. Due to the complicated 

15 current methods for generating AAV, there is only a 

limited amount of AAV available for use in industrial, 
medical and academic biotechnology procedures. The 
vectors and viruses of the present invention provide a 
convenient and efficient method for generating large 

20 quantities of rAAV particles. 

According to this aspect of the invention, a 
trans-infection particle is constructed as described 
above and in Example 3 and is employed to produce high 
levels of rAAV as detailed in Example 8, with the 

25 possible modifications described in Example 9 below. 
Briefly, a plasmid is generated that contains both AAV 
rep and cap genes under the control of a suitable plasmid 
and is complexed to the poly- lysine exterior of the 
hybrid virus as described above. This trans- infection 

30 particle is then permitted to infect a selected host 
cell, such as 293 cells. The presence of both rep and 
cap permit the formation of AAV particles in the cells 
and generate an AAV virus titer of about 10 9 virions. In 
contrast, current methods involving the transfection of 

35 multiple plasmids produce only about 10 7 virion titer. 
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The rAAV is isolated from the culture by selecting the 
Lac^-containing blue plagues and purifying them on a 
cesium chloride gradient . 

The benefit of this procedure relates to the 
5 fact that the cis AAV element is encoded by the parental 
adenovirus genome. As a result, the trans plasmid is the 
only DNA component that is needed for complex formation. 
The cell is thereby loaded with significantly more copies 
of the trans-acting rep and cap sequences, resulting in 

10 improved efficiency of rescue and packaging. 

Numerous comparative studies focusing on the 
optimal ratio and copy number of the cis and trans 
plasmids for AAV production indicated that there is a 
positive correlation between the trans plasmid copy 

15 number and yield of recombinant virus. As described in 
detail in Example 8, the yield of recombinant AV.CMVLacZ 
virus was increased by 5-10 fold by using the trans- 
infection particle instead of a standard adenovirus 
vector. 

20 The primary limitation associated with the 

. j 

production of recombinant AAV using a hybrid virus of 
this invention relates to difficulties that arise in 
distinguishing between the two viruses (i.e., adenovirus 
and AAV) that are produced by the cell. Using the 

25 exemplary vectors and vector components of this 

invention, LacZ histochemical staining could not be used 
to titer the yield of recombinant AV.CMVLacZ since any 
contaminating Ad.AV.CMVLacZ hybrid would contribute to 
the final count. Therefore, a rapid Southern blot 

30 technique for quantitating yields of recombinant AAV was 
incorporated. The assay that was developed enabled not 
only quantitation and verification of AAV production, but 
also demonstrated the removal of contaminating hybrid 
virus from purified AAV stocks. 
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Another method for detecting contaminating 
hybrid virions involves modifying the hybrid vector by 
inserting a small second reporter minigene (i.e., 
reporter gene, promoter and other expression control 
5 sequences, where desired) into vhe E3 region of the 

parental adenovirus backbone. Because this reporter is 
not linked to the AAV domain, contaminating hybrid virus 
that is present during purification can be monitored by 
this hybrid-specific marker. Another possible reporter 

10 gene is the nucleic acid sequence for green fluorescent 
protein. With this hybrid vector containing two reporter 
sequences; histochemical staining for alkaline 
phosphatase (adenovirus reporter) or fi-galactosidase (AAV 
reporter) activity can be used to monitor each viral 

15 domain. 

The following examples illustrate the 
construction and testing of the hybrid vectors of the 
present invention and the use thereof in the productions 
of recombinant AAV. These examples are illustrative 
20 only, and do not limit the scope of the present 
invention. 

Example 1 - gonstrMrtion Qt » HvErifl Virus 

A first hybrid adenovirus-AAV virus was 
25 engineered by homologous recombination between DMA 

extracted from an adenovirus and a complementing vector 
according to protocols previously described [see, e.g., 
K. F. Kozarsky et al, J, Biol, Chem. . 269 ; 13695-13702 
(1994) and references cited therein]. The following 
30 description refers to the diagram of Fig. 1. 

Adenovirus DNA was extracted from CsCl purified 
dl7001 virions, an Ad5 (serotype subgroup C) variant that 
carries a 3 kb deletion between mu 78.4 through 86 in the 
nonessential E3 region (provided by Dr. William Wold, 
35 Washington University, St. Louis, Missouri). Adenoviral 
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DNA was prepared for co-transfection by digestion with 
Clal (adenovirus genomic bp position 917) which removes 
the left arm of the genome encompassing adenovirus map 
units 0-2.5. See lover diagram of Fig. IB. 
5 The complementing hybrid vector , pAd.AV.CMVLacZ 

(see Fig. 1A and Fig. 2 [SEQ ID NO: 1]) was constructed 
as follows: 

A parental cloning vector, pAd.Bglll was 
designed. It contains two segments of wild-type Ad5 

10 genome (i.e., map units 0-1 and 9-16.1) separated by a 
unique Bglll cloning site for insertion of heterologous 
sequences. The missing Ad5 sequences between the two 
domains (adenovirus genome bp 361-3327) results in the 
deletion of Ela and the majority of Elb following 

15 recombination with viral DNA. 

A recombinant AAV genome (AV.CMVLacZ) was 
designed and inserted into the Bglll site of pAd.Bglll to 
generate the complementing plasmid. The linear 
arrangement of AV.CMVXacZ [SEQ ID NO: 1] (see top diagram 

20 of Fig. IB) includes: 

(a) the 5« AAV ITR (bp 1-173) obtained by PCR 
using pAV2 [C. A. Laugh 1 in et al, Gene . 21: 65-73 (1983)] 
as template [nucleotide numbers 365-538 of Fig. 2 [SEQ ID 
NO: 1]]; 

25 (b) a CMV immediate early enhancer /promoter 

[Boshart et al, Cell . 41:521-530 (1985); nucleotide 
numbers 563-1157 of Fig. 2 [SEQ ID NO: 1]], 

(c) an SV40 splice donor-splice acceptor 
(nucleotide numbers 1178-1179 of Fig. 2 [SEQ ID NO: 1]), 

30 (d) E. coll beta-galactosidase cDNA 

(nucleotide numbers 1356 - 4827 of Fig. 2 [SEQ ID NO: 

1J), 

(e) an SV40 polyadenylation signal (a 237 Bam 
HI-BclI restriction fragment containing the 
35 cleavage/poly-A signals from both the early and late 
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transcription units; nucleotide numbers 4839 ° 5037 of 
Figo 2 [SEQ ID mt 1]) and 

(f) 3°&&V 1TR 9 obtained from pAV2 as a SnaBI- 
Bglll fragment (nucleotide numbers 5053 - 5221 of Fig« 2 

5 [SEQ ID mi 1]) • 

The resulting complementing hybrid vector , 

pAd.AV.CSTOL&cff (see Fig 0 1A and Fig<> 2 [SEQ ID EJOs 1])* 

contained a single copy of recombinant MoGXNLmcZ flanked 

by adenovirus coordinates 0-1 on one side and 9=16 <>! on 

10 the other o Plasmid DMA was linearised using a unique 

Mhel site immediately 5° to adenovirus map unit sero (0) 
(resulting in the top diagram of Fig« IB) • 

Both the adenovirus substrate and the 
complementing vector DEJ&s were transfected to 293 cells 

15 [&TCC CRL1573] using a standard calcium phosphate 

transfection procedure [see, e 0 go, Sambrook et al, cited 
above] . The end result of homologous recombination 
involving sequences that map to adenovirus map units 9- 
16.1 is hybrid M.AV.CHV&acZ (see Fig 0 1C) in which the 

20 Ela and Elb coding region© from the d!7001 adenovirus 
substrate are replaced with the m<>cmLQcZ from the 
hybrid vector « 

Twenty-four hours later , the transfection 
cocktail was removed and the cells overlayed with 0o8% 

25 agarose containing ix BE2E and 2% fetal bovine serum 

(7BS) o Once viral plaques developed (typically 10-12 
days post-transf ©etion) , plagues were initially screened 
for Bo coll (^-galactosidase {L&vZ) activity by overlaying 
the infected monolayer with agarose supplemented with a 

30 histochemical stain for L&c% 0 according to the procedure 
described in Jo Price et al, ProCo^M&tl, &cad^_ Sc* * < PSfr* 

(1987) o Positive clones (identified by the 
deposit of insoluble blue dye) were isolated „ subjected 
to three rounds of freese (dry ice/ethanol) - thaw (37°C) 

35 and an aliquot of the suspended plaque was used to infect 
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a fresh monolayer of 293 cells seeded on duplicate 60mm 
plates. 

Twenty-four hours later the cells from one set 
of plates were fixed and again stained for LacZ activity. 
5 Cells from the duplicate plate were harvested, suspended 
in 0.5 ml 10 mM Tris-Cl, pH8.0, and lysed by performing a 
series of three freeze (dry ice/ethanol)-thaw (37°C) 
cycles. Cell debris was removed by centrifugation and an 
aliquot of the supernatant used to measure LacZ enzyme 
10 activity. 

As indicated in Fig. 3, assays for 0- 
galactosidase activity which measured the absorbance at 
420 run of the beta-galactosidase blue color in successful 
recombinants, revealed that eight of the ten isolated, 

15 putative positive clones (D1A through D1J) expressed high 
levels of enzyme. Histochemical staining produced 
similar results. 

Large-scale production and purification of 
recombinant virus was performed as described in Kozarsky 

20 et al, cited above, and references cited therein. 

Example 2 - Fu nctional Analysis of Hybrid Vector 

The ability to rescue the AV.GMVLacZ sequence 
[SEQ ID NO: 1] from the hybrid virus represented an 

25 important feature of the hybrid vector and virus systems 
of Example 1. To evaluate this feature, it was necessary 
to provide the necessary AAV gene products in trans that 
direct AAV excision and amplification (i.e. rep 
proteins) . Furthermore, this experiment was conducted in 

30 293 cells to transcomplement the El deletion in the 
Ad.AV.CMVLacZ clones, because the adenovirus El gene 
proteins have been shown to be important for initiating 
the lytic phase of the AAV lifecycle. 

293 cells were seeded onto 6-well 35 mm plates 

35 at a density of 1 x 10 6 cells/well. Twenty-four hours 
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later , seeding media [DS2EJ3/10S WBS supplemented with 
antibiotics] was replaced with 1.0 Ml DE3ES3/2S FBS and 
infected with &do&VoGK[V£ae£ hybrid clones at an E30I of 1. 
Two hours later , each well was transfected with 1 ^g 

5 plasmid pRep78/52 [SSQ ID KfOs 2j, a transacting plasmid 
that encodes the sequence encoding the MM rop 78 kD and 
52 kD proteins . The rep sequences in this construct are 
under the control of the MM PS promoter and utilise an 
SV40 polyadenylation signal . 

As a positive control for MM rescue , 293 cells 
seeded in a S~well plate a® above were co-transfected 
with a cis-acting MM plasmid pM.cmLmcZ and pRep78/52o 
pm*GmLac% contained MoCEWisc2 p the identical sequence 
encoded by pdtf .AV.CETC&ficff [SEQ ZD fiSOs 1] described in 

15 Example 1 cloned into the BglXI site of p$P72 (Promega)o . 

To provide the necessary adenovirus helper 
function for MM rescue g cells were infected with either 
wild-type M5- virus or a first generation El-deleted 
virus Ad.CEfflqpAP at an E30I of 5, approximately 2 hours 

20 prior to adding the transfection cocktail • AdoCDahp&P is 
identical to Ad.CETOtecX (Sample 1) with the modification 
that the alkaline phosphatase sequence (which can be 
obtained from Genbank) is inserted in place of the LacZ 
geneo 

25 Transfection© were performed with Lipofectamine 

(Life Technologies) according to the instructions 
provided by the manufacturer • Thirty hours post- 
transfection, the cells were harvested and episomal DSJ& 
(Hirt extract) prepared as described by Jo EJo Wilson et 

30 al, 3. Biol. Chemo . 262s (IS) % 11483-11489 (1992) . Samples 
were resolved on a 1«2% agarose gel and electroblotted 
onto a nylon membrane . Blots were hybridized (Southern) 
with a 32 P random primer- labeled restriction fragment 
isolated from the E* coli L&cZ cDBJAo 
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The full spectrum of duplex molecular species 
that appear during a lytic AAV infection (i.e., monomer ic 
forms of the double stranded intermediates, RFm and RFd, 
respectively) were evident in transfected cells infected 
5 with wild type and El deleted Ad;,. No replicative 
intermediates were detected when transfections were 
performed in the absence of helper virus. 

Hirt extracts from the 293 cells infected with 
putative Ad. AV.CMVLacZ hybrid clones D1A and D1C revealed 

10 a single band corresponding to the viral DNA, when probed 
with a LacZ restriction fragment. In the presence of rep 
proteins 78 and 52, however, the same clones yielded a 
banding pattern that included not only the adenovirus 
DNA, but an RF monomer and dimer of AV.CMVXacZ. A 

15 single-stranded form of AV.CHVLacZ [SEQ ID NO: 1] was not 
evident. Two additional clones gave similar banding 
patterns, DIB and D1H. In all, each of the eight 
Ad.AV.CMVLacZ hybrids that were found in Pig. 3 to 
express high levels of Lac Z activity were positive for 

20 rescue of the AAV domain. 

With the exception of an extra band of 
approximately 3.5 kb, the rescue of the AV.CKVLacZ [SEQ 
ID NO: l] from the hybrid viral DNA was nearly identical 
to results obtained from a standard cis and trans 

25 plasmid-based approach. In these later samples, 

adenovirus helper function was provided by pre-infecting 
cells with either wild-type Ad5 or an El-deleted 
recombinant virus Ad.CBhpAP (also termed H5.CBALP). The 
Ad.CBhpAP virus has the same sequence as the Ad.CMhpAP 

30 virus described above, except that the CMV promoter 

sequence is replaced by the chicken cytoplasmic fl-actin 
promoter [nucleotides +1 to +275 as described in T. A. 
Kost et al, Nucl. Acids Res. r H(23):8287 (1983)]. The 
level of rescue in cells infected with WT Ad5 appeared to 

35 be greater relative to those infected with the 
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recombinant AdoCEhp&P virus, likely due to the additional 
El esspression provided by the wild-type genome o The 
relevance of including an SI deleted adenovirus here is 
to document that the level of adenovirus 21 protein© 
5 esspressed in 293 cell© i© sufficient for MlV helper 
function o 

fi ffi g pnaia 3 - Synthesis . jpg_Ba\gtag*na Coniug&fc.&& 

Another version of the viral particle of this 

10 invention is a poly lysine conjugate with a rep plasmid 
completed directly to the hybrid virus capsid* Thi© 
conjugate permits efficient delivery of the rep 
©repression plasmid pRep7©/52 [SBQ'ZD MOs 2] in tandem 
with the hybrid virus 9 thereby removing the need for a 

15 separate transfection step 0 See, Fig G 8 for a 
diagrammatic outline of this construct ion e 

Purified stocks of a large-scale esspansion of 
Ad.AV.orc&acff clone D1A were modified by coupling poly-L- 
lysine to the virion capsid essentially a© described by 

20 Ko Jo Fisher and Jo »• Wilson, Blochen. Jo, £gas49-58 
(1994) p resulting in an ^.AV.CHV£Qcff-(Xiy&)a conjugate.* 
The procedure involves three steps • Firsts hybrid 
virion© are activated through primary amines on capsid 
proteins with the heterobifunctional water-soluble cross- 

25 linking agent, ©ulpho-SEJCC [sulpho~(I»-©ueeinimidyl 4H(8J- 
maleimidomethyl) -cyclohexane-l-carbossylate] (Pierce) • 
The conjugation reaction 9 which contained 0o5 mg (375 
nmol) of sulpho-SESCC and 6 x 10 12 A 260 hybrid vector 
particles in 3 .0 ml of HBS, was incubated at 30 d C for 45 

30 minutes with constant gentle shaking • This step involved 
formation of a peptide bond between the active M° 
hydrossysuccinimide (WHS) ester of sulpho-SMCC and a free 
amine (e 0 g 0 lysine) contributed by an adenovirus protein 
sequence (capsid protein) in the recombinant virus , 

35 yielding a maleimide-activated viral particle o 
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Unincorporated , unreacted cross-linker was 
removed by gel filtration on a 1 cm x 15 cm Bio-Gel P-6DG 
(Bio-Rad Laboratories) column equilibrated with 50 mM 
Tris/HCl buffer, pH 7.0, and 150 mM NaCl. Peak A 260 
5 fractions containing ma leimide- activated hybrid virus 
were combined and placed on ice. 

Second, poly-L- lysine having a molecular mass 
of 58 kDa at 10 mg/ml in 50 mM triethanolamine buffer (pH 
8*0), 150 mM NaCl and 1 mM EDTA was thiolated with 2- 

10 imminothiolane/HCl (Traut's Reagent; Pierce) to a molar 
ratio of 2 moles-SH/mole poly lysine under N 2 ; the cyclic 
thioimidate reacts with the poly (L- lysine) primary amines 
resulting in a thiolated poly cat ion. After a 45 minute 
incubation at room temperature the reaction was applied 

15 to a 1 cm x 15 cm Bio-Gel P6DG column equilibrated with 

50 mM Tris/HCl buffer (pH 7.0), 150 mM NaCl and 2 mM EDTA 
to remove unincorporated Traut's Reagent. 

Quantification of free thiol groups was 
accomplished with Ellman's reagent [5 # 5 , -dithio-bis-(2- 

20 nitrobenzoic acid) ] , revealing approximately 2 mol of - 
SH/mol of poly (L- lysine ) . The coupling reaction was 
initiated by adding 1 x 10 12 A 260 particles of maleimide- 
activated hybrid virus/mg of thiolated poly (L- lysine) and 
incubating the mixture on ice at 4°C for 15 hours under 

25 argon. 2-mercaptoethylamine was added at the completion 
of the reaction and incubation carried out at room 
temperature for 20 minutes to block unreacted maleimide 
sites. 

Virus-poly lysine conjugates, Ad.AV.CMVLac2- 
30 (Lys) n , were purified away from unconjugated poly(L- 
lysine) by ultracentrifugation through a CsCl step 
gradient with an initial composition of equal volumes of 
1.45 g/ml (bottom step) and 1*2 g/ml (top step) CsCl in 
10 mM Tris/HCl buffer (pH 8.0). Centrifugation was at 
35 90,000 g for 2 hours at 5°C. The final product was 
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dialyzed against 20 AM Hepes buffer (pH 7.8) containing 
150 mM NaCl (HBS) . 

Complexes of Ad.AV.CMVLac2-(Lys) n with 
pRep78/52 plasmid DNA [SEQ ID NO: 2] were formed by 
5 adding varying quantities of Ad. AV.CMVLacZ- (Lys) n in 50/il 
HBS to 0.5 ng of pRep78/52 plasmid DNA [SEQ ID NO: 2] in 
50/il HBS. After 30 minutes incubation at room 
temperature, a complex was formed of the hybrid virus 
Ad.AV.CMVLac2-(Lys) n associated in a single particle with 

10 the plasmid DNA containing the rep genes. 

This complex, termed a trans-infection 
particle, was evaluated for DNA binding capacity by gel 
mobility shift assays performed as described in Fisher et 
al, cited above. This analysis revealed that the plasmid 

15 binding capacity of the purified conjugate (expressed as 
the number of A 260 particles Ad.AV.CMVLacZ-(Lys) n that 
can neutralize the charge contributed by 1 nq plasmid 
DNA) was 1 pg pRep78/52 plasmid DNA/ 6.0 x 10 10 A 260 
particles Ad.AV.CMVLac2-(Lys) n . 

20 

Example 4 - Trans-Infection Protocol to Demonstrate AAV 
Excision and Amplification 

Trans-infection complexes were prepared by 
mixing Ad.AV.CMVLacZ-(Lys) n conjugate with pRep78/52 

25 plasmid [SEQ ID NO: 2] and applied to 293 cells as 

follows. Ad.AV.CMVLacZ-(Lys) n (6 x 10 10 A 260 particles) in 
100 Ml DMEM was added dropwise to a microfuge tube 
containing 1 M9 plasmid DNA in 100 Ml DMEM. The mixture 
was gently mixed and allowed to incubate at room 

30 temperature for 10-15 minutes. The trans-infection 

cocktail was added to 293 cells seeded in a 35 mm 6-well 
as detailed above. Thirty hours later/ cells were 
harvested and Hirt extracts prepared. 



35 



•* WO 96/13398 




PCT/DS9S/14018 



38 

Samples -<*ere resolved on a 1*2% agarose gel and 
electroblotted onto a nylon membrane. Blots were 
hybridized (Southern) with a P-32 random prissier-labeled 
restriction fragment isolated from the So coll LacZ cDWAo 
5 The Hirt extracts from 293 cells revealed a 

banding pattern that suggested the &VoC£3VX*ac2' minigene 
sequence [SSQ ID £30 s 1] was efficiently rescued from the 
hybrid conjugate o Both an RF monomer and dimer of the 
recombinant hVoCfflWLQcZ sequence were evident o As Has 

10 observed previously,, the rescue event was dependent on 
rep proteins since 293 cells that were trans- infected 
with a hybrid conjugate completed with an irrelevant 
reporter piasmid expressing alkaline phosphatase (i 0 e 0 
pCKTVhpAP) revealed only MohVoCmLmcZ DWAo This negative 

15 control for rescue was secondarily useful for 

demonstrating the high efficiency of gene transfer to 293 , 
cells that was achieved with the conjugate vehicle « 

A duplicate set of 293 cells that received 
hybrid conjugate which was further completed with 

20 alkaline phosphatase expression piasmid were fixed 24 

hours after addition of the trans-infection cocktail' and 
hist ©chemically stained for L&cZ as described in Price et 
al, cited above , or for alkaline phosphatase activity as 
described in Jo Ho Schreiber et al, BJLoTechnicmeji * 

25 2A°o 818-823 (1993) o Here X&cZ was a marker for the 

AdoAVoCKVLac^ hybrid, while alkaline phosphatase served 
as a reporter for the carrier piasmid • Greater than 90% 
of the monolayer was transduced with both fi-galactosidase 
and alkaline phosphatase transgenes, showing the high 

30 efficiency of the conjugate delivery vehicle 

(differential staining revealed a blue color for the 
hybrids containing the LacZ marker and a purple color for 
the plasmids bearing the AP marker) » 

Because of the important role El proteins have 

35 for progression of the AAV lifecycle, it was critical to 
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test the eff iciensy of the hybrid delivery system in a 
setting where SI protein© are not esspres^ddo A 
trans-infection eaeperiment using the hybrid conjugate 
completed with pRep78/52 [SSQ ID EJOs 2] was therefore 
5 conducted in HeLa cells [ATCC CCL2] to remove the 
involvement of El proteins • The findings suggested 
yascue ©f &VoC8NLacZ occurred evidenced by the 
accumulation of RF monomers and dimerso Rescue fro® HeLa 
cells (which unlike the 293 cells do not contain any 

10 adenovirus El proteins) revealed lower levels of rescue 
of the transgeneo The eacpression of rejp fro® the &&V P5 
promoter is upregulated by adenovirus SI and signals the 
beginning of the lytic cycle o In the absence of El, 
rep esqpression fro® the P5 promoter is virtually silent, 

15 which is important for Maintenance of the proviral latent 
stages of the MW lifecycle<> It is anticipated that a 
promoter not dependent on El expression will upon 
substitution for P5„ overcome this problems * , 

20 E3gamT0l@_5 - XnteorratloHLOf the 'granstaene 

A preliminary study has been performed to 
determine whether the sequence rescued from the 
hybrid virus can achieve provirus status in a target cell 
(Figo 7). Briefly, HeLa cells CCL 2] were infected 

25 with the hybrid conjugate completed with pRep78/52 [SEQ 
ID KFOs 2] and passaged until stable colonies of LacZ 
expressing cells were evident • h duplicate plate of 
cells was infected with the sarnie conjugate, but instead 
of being completed with the pRep78/52 plasmid [SEQ ID WOs 

30 2], carried an irrelevant plasmid . These findings 

indicated that cells that received the Rep containing 
hybrid particle produced a greater number of stable LacZ 
positive colonies than cells that were infected with the 
control virus o This could be interpreted as a reflection 

35 of multiple rescue and integration events in cells that 
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expressed Rep proteins • However, it is possible that an 
episomal form of AAV that can persist for extended 
periods of time was present. 

To establish the occurrence of integration into 
5 the chromosome of the minigene Av.CMVLacZ from the hybrid 
conjugate, the following experiment is performed. The 
Ad.AV.CMVLacZ-(Lys) n conjugate carrying pRep78/52 plasmid 
[SEQ ID NO: 2] is used to infect HeLa cells [ATCC CRL2] 
(primary fibroblasts may also be used) . The infected 

10 cells are passaged for several generations. The cells 
are grown to con fluency, split and allowed to grow to 
confluency again, split again and this cycle repeated as 
desired. This permits sufficient time for uptake, 
expression, replication and integration to occur. See 

15 Fig. 7. 

To verify that the recombinant AAV sequence 
that was rescued from the hybrid genome (step III of Fig. 
7) has integrated into a chromosome of the host cell 
(step IV of Fig. 7) , cells are separated by a 

20 Fluorescence Activated Cell Sorter (FACS) • By this 
technique, those cells containing a stable integrated 
copy of the recombinant AV.CMVLacZ minigene are separated 
based on the presence of the p-galactosidase reporter. 
These cells are tagged with fluoresce in- labeled 

25 antibodies that recognize the p-Gal protein, and are then 
separated from non-transduced cells (i.e. those that did 
not receive a copy of the AAV minigene) by FACS. 

DNA is isolated from this purified population 
of cells and used to construct a genomic library which is 

30 screened for individual clones and the sequence verified. 
If integration occurs, it is documented directly by 
sequence analysis. 
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B Mparole g - Gene* Transfer V<^icl<^oyLJ503^J^^ 

Mt adenoviru®~&&V-CFTR virus constructed by 
modifying the hybrid Ad.AV.GMVlac* virus described in 
Example 1 to contain the cystic fibrosis transmembrane 
5 regulator (CFTR) gen® [JoR. Rioidan efc al, Scjlganfifl^. 
2£j|s 1066-1073 (1989)] in place of the lacZ gene, using 
too^m technique® . One suitable method involve© producing 
a new vector using the technique© described in Example 1. 
In this new vector the X#ae# ninigene is replaced with the 

10 CFTR minigene. For performance of this method vectors 
bearing the CFTR gene have been previously described and 
can be readily constructed • This new or reconstructed 
vector is used to generate a new virus through homologous 
recombination as described above . The resulting hybrid 

15 virus is termed hybrid Ado&VoCMVCFTRo It has the 

sequence of Fig. 2 [SEQ ID 830 s 1] , except that the LbgZ 
gene is replaced with CFTRo Alternatively, the h&c% gene 
can be removed from the jtf .AV.GEW£&cZ vector of Example 1 
and replaced with the CFTR gene using known techniques • 

20 This virus (or an analogous hybrid virus with a 

different promoter, regulatory regions, etc.) is useful 
in gene therapy alone, or preferably, in the form of a 
conjugate prepared as described in Example <9o 

Treatment of cystic fibrosis, utilising the 

25 viruses provided above, is particularly suited for in 
viw, lung-directed, gene therapy . Airway epithelial 
cells are the most desirable targets for gene transfer 
because the pulmonary complications of CF are usually its 
most morbid and life-limiting. Thus, the hybrid vector 

30 of the invention, containing the CFTR gene, is delivered 
directly into the airway, e.g. by formulating the hybrid 
virus above, into a preparation which can be inhaled. 
For example, the hybrid virus or conjugate of the 
invention containing the CFTR gene, is suspended in 0.25 
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molar sodium chloride. The virus or conjugate is taken 
up by respiratory airway cells and the gene is expressed. 

Alternatively, the hybrid viruses or conjugates 
of the invention may be delivered by other suitable 
5 means, including site-directed injection of the virus 
bearing the CFTR gene. In the case of CFTR gene 
delivery, preferred solutions for bronchial instillation 
are sterile saline solutions containing in the range of 
from about 1 x 10 7 to 1 x 10 10 pfu/ml, more particularly, 

10 in the range of from about 1 x 10 8 to 1 x 10 9 pfu/ml of 
the recombinant hybrid virus of the present invention. 

Other suitable methods for the treatment of 
cystic fibrosis by use of gene therapy recombinant 
viruses of this invention may be obtained from the art 

15 discussions of other types of gene therapy vehicles for 
CF. See, for example, U. S. Patent No. 5,240,846, 
incorporated by reference herein. 

Example 7 - Gene Transfer Vehicle for Familial 

20 Hypercholesterolemia 

Familial hypercholesterolemia (FH) is an 
autosomal dominant disorder caused by abnormalities 
(deficiencies) in the function or expression of LDL 
receptors [M.S. Brown and J.L. Goldstein, Science. 

25 222(4746) : 34-37 (1986); J.L. Goldstein and M.S. Brown, 
"Familial hypercholesterolemia" in Metabolic Basis of 
inherited Disease. , ed. C.R. Scriver et al, McGraw Hill, 
New York, ppl215-1250 (1989).] Patients who inherit one 
abnormal allele have moderate elevations in plasma LDL 

30 and suffer premature life-threatening coronary artery 
disease (CAD) . Homozygous patients have severe 
hypercholesterolemia and life-threatening CAD in 
childhood. 
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A hybrid adenovirus-AAV-LDL virus of the 
invention is constructed by replacing the lacZ gene in 
the hybrid Ad.AV.CMVlacZ virus of Example 1 with an LDL 
receptor gene [T. Yamamoto et al, Sell, 21:27-38 (1984)] 
5 using known techniques and as described analogously for 
CF in the preceding example. Vectors bearing the LDL 
receptor gene can be readily constructed according to 
this invention. The resulting hybrid vector is termed 
pAd . AV . CHNLDL . 

10 This plasmid or its recombinant virus is useful 

in gene therapy of FH alone, or preferably, in the form 
of a viral conjugate prepared as described in Example 4 
to substitute a normal LDL gene for the abnormal allele 
responsible for the gene. 

15 A. Ex Vivo Gene Therapy 

Ex vivo gene therapy can be performed by 
harvesting and establishing a primary culture of 
hepatocytes from a patient. Known techniques may be used 
to isolate and transduce the hepatocytes with the above 

20 vector (s) bearing the LDL receptor gene(s). For example, 
techniques of collagenase perfusion developed for rabbit 
liver can be adapted for human tissue and used in 
transduction. Following transduction, the hepatocytes 
are removed from the tissue culture plates and reinfused 

25 into the patient using known techniques, e.g. via a 
catheter placed into the inferior mesenteric vein. 
B. Tn Vivo Gene Therapy 

Desirably, the in vivo approach to gene 
therapy, e.g. liver-directed, involves the use of the 

30 hybrid viruses and viral conjugates described above. A 
preferred treatment involves infusing a trans-infection 
particle of the invention containing LDL into the 
peripheral circulation of the patient. The patient is 
then evaluated for change in serum lipids and liver 

35 tissues. 
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The hybrid virus or viral conjugate can be 
used to infect hepatocytes in vivo by direct injection 
into a peripheral or portal vein (10 7 -10 8 pfu/kg) or 
retrograde into the biliary tract (same dose) • This 
5 effects gene transfer into the majority of hepatocytes. 

Treatments are repeated as necessary, e.g. 
weekly. Administration of a dose of virus equivalent to 
an MOI of approximately 20 (i.e. 20 pfu/hepatocyte) is 
anticipated to lead to high level gene expression in the 
10 majority of hepatocytes. 

Example 8 - Efficient Production of Recombinant AAV using 
A Hybrid Virus /Conjugate 

The following experiment demonstrated that the 

15 AAV genome that was rescued from the Ad.AV.CMVLacZ hybrid 
virus could be packaged into an AAV caps id, provided the 
cap open reading frame was supplied in trans. Thus the 
viruses of this invention are useful in a production 
method for recombinant AAV which overcomes the prior art 

20 complications that surround the high titer production of 
recombinant AAV. 

a. Trflnp-Infegtiwi PrpfrogQl for the 

Production of rAAV 

A trans-infection complex was formed 
25 composed of the Ad.AV.CMVLacZ-(Lys) n conjugate described 

above and a transcomplementing plasmid pAdAAV, which is 

described in detail in R. J. Samulski et al, J. Virol. . 

£2(9) :3822-3828 (1989)]. Briefly, plasmid pAdAAV encodes 

the entire rep and cap open reading frames in the absence 
30 of AAV ITRs, and has been shown to provide the necessary 

AAV helper functions for replication and packaging of 

recombinant AAV sequences. 

Ad.AV.CMVLac2-(Lys) n conjugate (4.5 x 10 13 

A 2 eo particles) in 75 ml DMEM was added dropwise with 
35 constant gentle swirling in 25 ml DMEM containing 750 /ig 
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pAd&AV helper plasmid and incubated at room temperature 
for 10-15 minutes o The coupler was diluted with 450 ml 
D52ESJ supplemented with 2$ FBS and 20 ml aliquot© were 
added to monolayer© of 293 cell© ©eeded on 150 mm plate© • 
5 Forty hours post tran**- infect ion , cells were 

harvested, suspended in 12 ml 10 mE2 Tris~Cl (pH 8o0) , and 
stored at ~80 @ Co 

Because the anticipated outcome was the 
production of hybrid virus AdoAVoGE2W£ac£ and a 

10 recombinant AAV virion (AV.CSre&ac*)', both of which carry 
a functional LmcZ minigene, it was not possible to use 
detection of LqgZ activity as an indicator of hV»G$NL&cZ 
production . A novel molecular approach was developed 
that could be performed in one day and permitted 

15 identification of the packaged viral DKT&So 

Bo Furification^^JgM3? 

Briefly, frosen cell suspension© were 
subjected to three rounds of freese-thaw cycles to 
release recombinant hVo&NL&c% and hybrid AdoAVoC£W&©c#o 

20 On completion of the final thaw, bovine pancreatic DSSA©e 
(2000 units) and ribonuclease (0o2 mg/ml final 
concentration) was added and the extract incubated at 
37 ®C for 30 minutes • Cell debris was removed by 
centrifugation (5000xg for 10 minutes) and the clarified 

25 supernatant (15 ml) applied to a 22 . 5 ml step gradient 

composed of eqiual volume© of CsCl at 1«2 g/ml, lo3S g/ml, 
and lo45 g/ml Hte£2 Trls-Cl, pHSoOo Viral particles were 
banded at 25,000 rpm in a Beckman SW~28 rotor for 8 hours 
at 4®Co One ml fractions were collected from the bottom 

30 of the tube<> 

The fractions retrieved from the CsCl 
gradient of partially purified virus are then digested to 
release viral DMA from virion capsids as follows o A 
5o0^1 sample of each fraction was transferred to a 
35 microfuge tube containing 20 pi capsid digestion buffer 
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(50mM Tris-Cl, pHC.O, l.OmM EDTA, pH8,0, 0.5% SDS, and 
1.0 mg/ml Proteinase K) . The reaction was incubated at 
50 °C for 1 hour, allowed to cool to room temperature, 
diluted with 10 pi milli-Q water, and agarose gel loading 
5 dye added. 

These fractions are then analyzed by 
Southern blotting. Samples were resolved on a 1.2% 
agarose gel, electroblotted onto a nylon membrane. A 32 P 
labeled LacZ restriction fragment which was common to 
10 both vectors was used as a hybridization probe to locate 
the migration of viral DNA through the agarose gel. 
Viral bands were quantitated on a Molecular Dynamics 
Phosphoimager . 

A sample of the extract before CsCl 

15 banding was also tested and revealed both hybrid 

Ad.AV.CMVLacZ DNA and double-stranded RF forms (monomers 
and dimers) of the rescued AV.CMVLacZ sequence [SEQ ID 
NO: 1]. A single-stranded monomer of AV.CMVLacZ appeared 
to be present in the crude extract; however, it was not 

20 until the virions were concentrated by buoyant density 
ultracentrifugation that the single-stranded genome 
became clearly evident. The single-stranded recombinant 
genome of the virus was distributed over a range of CsCl 
densities and revealed a biphasic banding pattern. The 

25 two peaks of single-stranded rAAV genome occurred at 

densities of 1.41 and 1.45 g/ml CsCl, consistent with the 
reported buoyant densities of wild-type AAV in CsCl [L. 
M. de la Naza et al, J. Virol. . 33; 1129-1137 (1980)]. 
Analysis of the fractions corresponding to the two vector 

30 forms revealed the rAAV-1.41 species was several orders 
of magnitude more active for lacZ transduction than the 
denser rAAV-1.45 g/ml variant. To avoid confusion with 
contaminating Ad. AAV, samples were heat inactivated (60°C 
for 30 min) before being added to indicator HeLa cells. 
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Hie peak fractions of rAAV-1.41 were 
combined and purified by equilibrium sedimentation in 
CsCl to eliminate residual adenovirus particles and 
concentrate rAAV virions. On the final round of 
5 ultracentrifugation, a faint buc clearly visible 
opalescent band was observed in the middle of the 
gradient tube. Fractions that surrounded the band were 
evaluated for density, absorbance at 260 nm, and lacZ 
transducing particles. As the band eluted from the 
10 gradient tube, a well defined peak of 260 nm absorbing 
material was recorded, . with a maximal absorbance 
occurring at a density of 1.40 g/ml CsCl. Analysis for 
lacZ transducing particles on HeLa cells revealed a peak 
of activity that mirrored the absorbance profile. These 
15 results indicate rAAV was produced from the hybrid Ad. AAV 
virus. Furthermore, the titers achieved using the hybrid 
virus were 5-10 fold elevated compared to more 
conventional recombinant AAV production schemes (i.e., 
transf ections with els- and trans-acting plasmids) . This 
20 represents a significant improvement in rAAV production 
and indicates that the hybrid is useful for large-scale 
rAAV production. 

All references recited above are incorporated 
herein by reference. Numerous modifications and 
25 variations of the present invention are included in the 
above-identified specification and are expected to be 
obvious to one of skill in the art. Such modifications 
and alterations to the compositions and processes of the 
present invention, such as those modifications permitting 
30 optimal use of the hybrid viruses as gene therapy 

vehicles or production vehicles for recombinant AAV 
production, are believed to be encompassed in the scope 
of the claims appended hereto. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GAATTCGCTA 


GCATCATCAA 


TAATATACCT 


TATTTTGGAT 


TGAAGCCAAT 


50 


ATGATAATGA 


GGGGGTGGAG 


TTTGTGACGT 


GGCGCGGGGC 


GTGGGAACGG 


100 


GGCGGGTGAC 


GTAGTAGTGT 


GGCGGAAGTG 


TGATGTTGCA 


AGTGTGGCGG 


150 


AACACATGTA 


AGCGACGGAT 


GTGGCAAAAG 


TGACGTTTTT 


GGTGTGCGCC 


200 


GGTGTACACA 


GGAAGTGACA 


ATTTTCGCGC 


GGTTTTAGGC 


GGATGTTGTA 


250 


GTAAATTTGG 


GCGTAACCGA 


GTAAGATTTG 


GCCATTTTCG 


CGGGAAAACT 


300 


GAATAAGAGG 


AAGTGAAATC 


TGAATAATTT 


TGTGTTACTC 


ATAGCGCGTA 


350 


ATATTTGTCT 


AGGGAGATCT 


GCTGCGCGCT 


CGCTCGCTCA 


CTGAGGCCGC 


400 








TGGTCGCCCG 


GCCTCAGTGA 


450 


GCGAGCGAGC 


GCGCAGAGAG 


GGAGTGGCCA 


ACTCCATCAC 


TAGGGGTTCC 


500 


TTGTAGTTAA 


TGATTAACCC 


GCCATGCTAC 


TTATCTACAA 


TTCGAGCTTG 


550 


CATGCCTGCA 


GGTCGTTACA 


TAACTTACGG 


TAAATGGCCC 


GCCTGGCTGA 


600 


CCGCCCAACG 


ACCCCCGCCC 


ATTGACGTCA 


ATAATGACGT 


ATGTTCCCAT 


650 


AGTAACGCCA 


ATAGGGACTT 


TCCATTGACG 


TCAATGGGTG 


GAGTATTTAC 


700 


GGTAAACTGC 


CCACTTGGCA 


GTACATCAAG 


TGTATCATAT 


GCCAAGTACG 


750 


CCCCCTATTG 


ACGTCAATGA 


CGGTAAATGG 


CCCGCCTGGC 


ATTATGCCCA 


800 


GTACATGACC 


TTATGGGACT 


TTCCTACTTG 


GCAGTACATC 


TACGTATTAG 


850 


TCATCGCTAT 


TACCATGGTG 


ATGCGGTTTT 


GGCAGTACAT 


CAATGGGCGT 


900 


GGATAGCGGT 


TTGACTCACG 


GGGATTTCCA 


AGTCTCCACC 


CCATTGACGT 


950 


CAATGGGAGT 


TTGTTTTGGC 


ACCAAAATCA 


ACGGGACTTT 


CCAAAATGTC 


1000 
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GTAACAACTC 


CGCCCCATTG 


ACGCAAATGG 


GCGGTAGGCG 


TGTACGGTGG 


1050 


6AG6TCTATA 


TAAGGAGAGC 


TCGTTTAGTG 


AACCGTCAGA 


TCGCCTGGAG 


1100 


ACGCCATCCA 


CGCTGTTTTG 


ACCTCCATAG 


AAGACACCGG 


GACCGATCCA 


1150 


6CCTCCG6AC 


TCTAGAGGAT 


CCGGTACTCG 


AGGAACTGAA 


AAACCAGAAA 


1200 


GTTAACTGGT 


AAGTTTAGTC 


TTTTTGTCTT 


TTATTTCAGG 


TCCCGGATCC 


1250 


GGTGGT6GTG 


CAAATCAAAG 


AACTGCTCCT 


CAGTGGATGT 


TGCCTTTACT 


1300 


TCTAGGCCTG 


TACGGAAGTG 


TTACTTCTGC 


TCTAAAAGCT 


GCGGAATTGT 


1350 


ACCCGCGGCC 


GCAATTCCCG 


GGGATCGAAA 


GAGCCTGCTA 


AAGCAAAAAA 


1400 


GAAGTCACCA 


TGTCGTTTAC 


TTTGACCAAC 


AAGAACGTGA 


TTTTCGTTGC 


1450 


CGGTCTGGGA 


GGCATTGGTC 


TGGACACCAG 


CAAGGAGCTG 


CTCAAGCGCG 


1500 


ATCCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


CGTTACCCAA 


1550 


CTTAATCGCC 


TTGCAGCACA 


TCCCCCTTTC 


GCCAGCTGGC 


GTAATAGCGA 


1600 


AGAGGCCCGC 


ACCGATCGCC 


CTTCCCAACA 


GTTGCGCAGC 


CTGAATGGCG 


1650 


AATGGCGCTT 


TGCCTGGTTT 


CCGGCACCAG 


AAGCGGTGCC 


GGAAAGCTGG 


1700 


CTGGAGTGCG 


ATCTTCCTGA 


GGCCGATACT 


GTCGTCGTCC 


CCTCAAACTG 


1750 


GCAGATGCAC 


GGTTACGATG 


CGCCCATCTA 


CACCAACGTA 


ACCTATCCCA 


1800 


TTACGGTCAA 


TCCGCCGTTT 


GTTCCCACGG 


AGAATCCGAC 


GGGTTGTTAC 


1850 


TCGCTCACAT 


TTAATGTTGA 


TGAAAGCTGG 


CTACAGGAAG 


GCCAGACGCG 


1900 


AATTATTTTT 


GATGGCGTTA 


ACTCGGCGTT 


TCATCTGTGG 


TGCAACGGGC 


1950 


GCTGGGTCGG 


TTACGGCCAG 


GACAGTCGTT 


TGCCGTCTGA 


ATTTGACCTG 


2000 


AuV#uv«n 11X1 




AGAAAACCGC 


CTCGCGGTGX 


TGGTGCTGCG 


2050 


TTGGAGTGAC 


GGCAGTTATC 


TGGAAGATCA 


GGATATGTGG 


CGGATGAGCG 


2100 


GCATTTTCCG 


TGACGTCTCG 


TTGCTGCATA 


AACCGACTAC 


ACAAATCAGC 


2150 


GATTTCCATG 


TTGCCACTCG 


CTTTAATGAT 


GATTTCAGCC 


GCGCTGTACT 


2200 


GGAGGCTGAA 


GTTCAGATGT 


GCGGCGAGTT 


GCGTGACTAC 


CTACGGGTAA 


2250 


CAGTTTCTTT 


ATGGCAGGGT 


GAAACGCAGG 


TCGCCAGCGG 


CACCGCGCCT 


2300 
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TTCGGCGGTG AAATTATCGA TGAGCGTGGT 
ACTAC6TCTG A&CGTCGAAA ACCCGAAACT 
ATCTCTATCG TGCGGTGGTT GAACT6CACA 
GAAGCAGAAG CCTGCGATGT CGGTTTCCGC 
TCTGCTGCTG CTGAACGGCA AGCCGTTGCT 
ACGAGCATCA TCCTCTGCAT GGTCAGGTCA 
CAGGATATCC TGCTGATGAA GCAGAACAAC 
GCATTATCCG AACCATCCGC TGTGGTACAC 
TGTATGTGGT GGATGAAGCC AATATTGAAA 
AATCGTCTGA CCGATGATCC GCGCTGGCTA 
AACGCGAAT6 GTGCAGCGCG ATCGTAATC& 
CGC7GGGGAA TGAATCAGGC CACGGCGCTA 
•EGGATCAAAT CTGTCGATCC TTCCCGCCC6 
AGCCGACACC ACGGCCACCG ATATTAETTG 
ATGAAGACCA GCCCTTCCCG GCTGTGCCGA 
CTTTCGCTAC CTGGAGAGAC GCGCCCGCTG 
CGCGATGGGT AACAGTCTT6 GCGG^TCGC 
GTCAGTATCC CCGTTTACAG GGCGGCTTCG 
TCGCTGATTA AAT&TGATGA AAACGGCAAC 
TGATTTTGGC GATACGCCGA ACGATCGCC& 
TCTTTGCCGA CCGCACGCCG CATCCAGCGC 
CAGCAGTTTT TCCAGTTCCG TTTATCCGGG 
CGAATACCTG TTCCGTCATA GCGATAACGA 
CGCTGGATGG TAAGCCGCTG GCAAGCGGTG 
CCACAAGGT& &ACAGTTGAT TGAACTGCCT 
CGCCGGGCAA CTCTGGCTCA CA6TACGCGT 



GGTTATGCCG 


ATCGC6TCAC 


2350 


GTGGA6CGCC 


GAAATCCCGA 


2400 


CCGCCGACGG 


CAC6CTGATT 


2450 


GKGnJTGGGGK 


TTGAAAATGG 


2500 


GATTCGAGGC 


GTTAACCGTC 


2550 


TGGATGAGCA 


GACGATGGTG 


2600 


"SOTAACGCCG 


TGCGCTGTTC 


2650 


GCTGTGCGAC 


CGCTACGGCC 


2700 


CCCACGGCAT 


GGTGCCAATG 


2750 


CCGGCGATGA 


GCGAACGCGT 


2800 


CCCGAGTGTG 


ATCATCTGGT 


2850 


ATCACGACGC 


GCTGTATCGC 


2900 


GflGCAGTATG 


AAGGCGGCGG 


2950 


CCCGATGTAC 


GCGCGCGTGG 


3000 


AATGGTCCAT 


CAAAAAATGG 


3050 


ATCCTOTGCG 


AATACGCCCA 


3100 


TAAATACTGG 


CAGGCGfTTC 


3150 


TCTGGGACTG 


GGTGGATCAG 


3200 


CCGTGGTCGG 


CTTACGGCGG 


3250 


GTTCTGTATG 


AACGGTCTGG 


3300 


TG&CGGAAGC 


AAAACACCAG 


3350 


CAAACCATCG AAGTGACCAG 


3400 


GCTCCT6CAC 


TGGATGGTGG 


3450 


AAGTGCCTCT 


GGATGTCGCT 


3500 


GAACTACCGC 


AGCCGGAGAG 


3550 


AGTGCAACCG 


AACGCGACCG 


3600 
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CATGGTCAGA 


AGCCGGGCAC 


ATCAGCGCCT 


GGCAGCAGTG 


GCGTCTGGCG 


3650 


GAAAACCTCA 


GTGTGACGCT 


CCCCGCCGCG 


TCCCACGCCA 


TCCCGCATCT 


3700 


GACCACCAGC 


GAAATGGATT 


TTTGCATCGA 


GCTGGGTAAT 


AAGCGTTGGC 


3750 


AATTTAACCG 


CCAGTCAGGC 


TTTCTTTCAC 


AGATcTGGAT 


TGGCGATAAA 


3800 


AAACAACTGC 


TGACGCCGCT 


GCGCGATCAG 


TTCACCCGTG 


CACCGCTGGA 


3850 


TAACGACATT 


GGCGTAAGTG 


AAGCGACCCG 


CATTGACCCT 


AACGCCTGGG 


3900 


TCGAACGCTG 


GAAGGCGGCG 


GGCCATTACC 


AGGCCGAAGC 


AGCGTTGTTG 


3950 


CAGTGCACGG 


CAGATACACT 


TGCTGATGCG 


GTGCTGATTA 


CGACCGCTCA 


4000 


CGCGTGGCAG 


CATCAGGGGA 


AAACCTTATT 


TATCAGCCGG 


AAAACCTACC 


4050 


GGATTGATGG 


TAGTGGTCAA 


ATGGCGATTA 


CCGTTGATGT 


TGAAGTGGCG 


4100 


AGCGATACAC 


CGCATCCGGC 


GCGGATTGGC 


CTGAACTGCC 


AGCTGGCGCA 


4150 


GGTAGCAGAG 


CGGGTAAACT 


GGCTCGGATT 


AGGGCCGCAA 


GAAAACTATC 


4200 


CCGACCGCCT 


TACTGCCGCC 


TGTTTTGACC 


GCTGGGATCT 


GCCATTGTCA 


4250 


GACATGTATA 


CCCCGTACGT 


CTTCCCGAGC 


GAAAACGGTC 


TGCGCTGCGG 


4300 


GACGCGCGAA 


TTGAATTATG 


GCCCACACCA 


GTGGCGCGGC 


GACTTCCAGT 


4350 


TCAACATCAG 


CCGCTACAGT 


CAACAGCAAC 


TGATGGAAAC 


CAGCCATCGC 


4400 


CATCTGCTGC 


ACGCGGAAGA 


AGGCACATGG 


CTGAATATCG 


ACGGTTTCCA 


4450 


TATGGGGATT 


GGTGGCGACG 


ACTCCTGGAG 


CCCGTCAGTA 


TCGGCGGAAT 


4500 


TACAGCTGAG 


CGCCGGTCGC 


TACCATTACC 


AGTTGGTCTG 


GTGTCAAAAA 


4550 


TAATAATAAC 


CGGGCAGGCC 


ATGTCTGCCC 


GTATTTCGCG 


TAAGGAAATC 


4600 


CATTATGTAC 

^^•k m± m\ mm m> m* mrn^* 


TATTTAAAAA 


ACACAAACTT 


TTGGATGTTC 


GGTTTATTCT 


4650 


TTTTCTTTTA 


CTTTTTTATC 


ATGGGAGCCT 


ACTTCCCGTT 


TTTCCCGATT 


4700 


TGGCTACATG 


ACATCAACCA 


TATCAGCAAA 


AGTGATACGG 


GTATTATTTT 


4750 


TGCCGCTATT 


TCTCTGTTCT 


CGCTATTATT 


CCAACCGCTG 


TTTGGTCTGC 


4800 


TTTCTGACAA 


ACTCGGCCTC 


GACTCTAGGC 


GGCCGCGGGG 


ATCCAGACAT 


4850 


GATAAGATAC 


ATTGATGAGT 


TTGGACAAAC 


CACAACTAGA 


ATGCAGTGAA 


4900 
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A&A&&TGCTT TA3OTGTG&A ATTTGTGATG 
ATTATAAGCT GCAATAAAC& AGTTAACA&C 
GTTTChGGTS CAGGGGGAGG TGTGG6AGGT 
CGAGTAGATA AGTAGCATGG CGGGTTAATC 
AGTGATGGAG TTGGCCACTC CCTCTCTGCG 
CCGGGCGACC AAAGGTCGCC CGACGCCCGG 
GTGAGCGAGC GAGCGCGCAG CAGATCTGGA 
&CCCGCACC& GGT6C&GACC CTGCGAGTGT 
CGAGCCTGTG ATGCTGGATG TGACCGAGGA 
TGCTGGCCTG CACCCGCGCT GAGT'TTGGCT 
TGAGGTACTG AAATGTGTGG GC6TGGCTTA 
AGGTGGGGGT CTTATGTAGT TTTGTATCTG 
CCATGAGCAC CAACTCGTTT GATGGAAGCA 
ACGCGCATGC CCCCATGGGC GGGGGTGCGT 
CATTGATGGT CGCCCCGTCC TGCCCGCAAA 
A6ACCGTGTC TGGAACGCCG TTGGAGACTG 
GCCGCTGCAG CCACCGCCCG CGGGA'fTGEG 
CCCGCOTGCA AGCAGTGCAG CTTCCCGTTC 
TGACGGCTCT TTTGGCACAA TTOG&TTCTT 
GTTTCTCAGC &GCTGTTGGA TCTGCGCGAG 
TTCCTCCCCT CCCAATGCG6 TTTAAAACAT 
TTTGGATTTG GATCAAGCAA GTGTCTTGCT 
CGCGCGCGGT &GGCCCGGGA CCAGCGGTCT 
TATTTTTTCC AGGACGTGGT AAAGGTGACT 
GCATAAGCCC GTCTCTGGGG TGGAGGT&GC 
TGCGGGGTGG TGTTGTAGAT GATCCAGTCG 



OTATTGCTTT 


ATTTGTAACC 


4950 


AACAATTGCA 


TTCATTTTAT 


5000 


TTTSTCGGhH 


CCTCTAGAGT 


5050 


AT1A&CTACA 


AGGAACCCCT 


5100 


CGCTCGCTCG 


CTCACTGAGG 


5150 


GCTfTGCCCG 


GGCGGCCTCA 


5200 


AGGTGCTGAG 


GTACGATGAG 


5250 


GGCGGTAAAC 


ATATTAGGAA 


5300 


GCTGAGGCCC 


GATCACTTGG 


5350 


CTAGCGATGA AGATAGAGAT 


5400 


AGGGTGGGAA 


AGAATATATA 


5450 


TTTTGCAGCA 


GCCGCCGCCG 


5500 


TTGTGAGCTC 


ATATOTGACA 


5550 


CAGAATGTGA 


TGGGCTCCAG 


. 5600 


CTCTACTACC 


T2JGACCTACG 


5650 


CAGCCTCCGC 


CGCCGCTTCA 


5700 


ACTGACTTTG 


CTTTCCTGAG 


5750 


ATCCGCCCGC 


GATGACAAGT 


5800 


TGACCCGGGA ACTTAATGTC 


5850 


CAGGTTTCTG 


CCCTGAAGGC 


5900 


AAATAAAAAA 


CCAG&CTCTG 


5950 


GTCTTTATTT 


AGGGGTTTTG 


6000 


CGGTCGTTGA 


GGGTCCTGTG 


6050 


CTGGATGTTC 


AGATACATGG 


6100 


ACCACTGCAG 


AGCTTCATGC 


6150 


TAGCAGGAGC 


GCTGGGCGTG 


6200 
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GTGCCTA&&& 


ATGTCTTTCA 


GTAGCAAGCT 


GATTGCCAGG 


GGCAGGCCCT 


6250 


TGGTGTAAGT 


GTTTACAAAG 


CGGTTAAGCT 


GGGATGGGTG 


CATACGTGGG 


6300 


GAT&TGAG&T 


GC&TCTTGGA 


CTGTATTTTT 


AGGTTGGCTA 


TGTTCCCAGC 


6350 


C&T&TCCCTC 


CGGGGATTCA 


TGTTGTGCAG 


A&CC^CCAGC 


ACAGTGTATC 


6400 


CGGTGCACTT 


GGGAA&TTTG 


TCATGTAGCT 


TAGAAGGAAA 


TGCGTGGAAG 


6450 


AACTTGG&G& 


CGCCCTTGTG 


ACCTCCAAGA 


TTTTCCATGC 


ATTCGTCCAT 


6500 


A&TGATGGC& 


ATGGGCCCAC 


GGGCGGCGGC 


CTGGGCGAAG 


&T&TTTCTGG 


6550 


G&TC&CT&AC 


GTCATAGTTG 


TGTTCCAGGA 


TGAGATCGTC 


ATAGGCCATT 


6600 


TOTACAAAGC 


GCGGGCGGAG 


GGTGCCAGAC 


TGCGGTATAA 


TGGTTCCATC 


6650 


CGGCCCAGGG 


GCGTAGTTAC 


CCTCACAGAT 


TTGC&TTTCC 


CACGCTTTGA 


6700 


GTTCAGATGG 


GGGGATCATG 


TCTACCTGCG 


GGGCGATGAA 


GAAAACGGTT 


6750 


TCCGGGGTAG 


GGG&G&TCAG 


OTGGG&AGAA 


AGCAGGTTCC 


TGAGCAGCTG 


6800 


CGACTTACCG 


C&GCCGGTGG 


GCCCGTAAAT 


CACACCTATT 


ACCGGGTGCA 


6850 


&CTGGTAGTT 


AAGAGAOCTG 


CAGCTGCCGT 


GATCCCTGAG 


CAGGGGGGCC 


6900 


ACTTCGTTAA 


GCATGTCCCT 


GACTCGCATG 


TTTTCCCTGA 


CCAAATCCGC 


6950 


C&GAAGGCGC 


TCGCCGCCCA 


GCGATAGCAG 


TTC2TGCAAG 


GAAGCAAAGT 


7000 


TTTTC&&CGG 


TTTGAG&CCG 


TCCGCCGTAG 


GCATGCTTTT 


GAGCGTTTGA 


7050 


CC&AGC&GTT 


CCAGGCGSTC 


CCACAGCTCG 


GTCACCTGCT 


CTACGGCATC 


7100 


TCGATCCAGC 


ATATCTCCTC 


GTTTCGCGGG 


TTGGGGCGGC 


TTTCGCTGTA 


7150 


CGGCAGTAGT 


CGGTGCTCGT 


CCAGACGGGC 


CAGGGTCATG 


TCTTTCCACG 


7200 


GGCGCAGGGT 


CCTCGTCAGC 


GTAGTCTGGG 


TCACGGTGAA 


GGGGTGCGCT 


7250 


CCGGGCTGCG 


CGCTGGCCAG 


GGTGCGCTTG 


AGGCTGGTCC 


TGCTGGTGCT 


7300 


GAAGCGCTGC 


CGGTCTTCGC 


CCTGCGCGTC 


GGCCAGGTAG 


CATTTGACCA 


7350 


TGGTGTCATA 


GTCCAGCCCC 


TCCGCGGCGT 


GGCCCTTGGC 


GCGCAGCTTG 


7400 


CCCTTGGAGG 


AGGCGCCGCA 


CGAGGGGCAG 


TGCAGACTTT 


TGAGGGCGTA 


7450 


GAGCTTGGGC 


GCGAGAAATA 


CCGATTCCGG 


GGAGTAGGCA 


TCCGCGCCGC 


7500 
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&GGCCCCGC& G&CGGTCTCG C&TTCC&CGA 
TC6GGGTC&& MKCC&GGTS TCCCCCATGC 
TOTGGTTTCC &TG&GCCGGT GTCC&CGCTC 
TGTCCCCGT& T&e&G&CTTG &G&GGCCTGT 
&GCCTTC&AC CC&GTC&GCT CCTTCCGGTG 
TCGCCGC&CT TATGACTGTC TTCTTT&TC& 
CCGGC&GCGC TCTGGGTCAT TTTCGGCGAG 
G&CG&TG&TC GGCCTGTC6C TTGCGGTATT 
iSTCAAGCCTT CGTC&CTGGT CCCGCC&CC& 
GCC&TT&TCG CCGGC&TGGC GGCCG&OGCG 
GTTCGCGACG CG&GGCTGG& EGGCCTTCCC 
CCGGCGGG&T CGGG&TGCCC GCGTTGC&GG 
G&TGACG&CC faTCMsGGhCh GCF2CMGG& 
CCT&&CTTCG &TC&CTGGAC CGCTG&TCGT 
CGGCG&GC&C ATGG&AOSGG TTGGC&TGG& 
CfTGTCTGCC TCCCCGCGTT GCGTCGCGGT 
G&CCTGAA.TC G&&GCCGGCG GCACCTCGCT 
&&TTGGAGCC AATCAATTCT TGCGGAGM.C 
CTTGGG&G&A e&T&TCC&TC GCGTCCGCC& 
CGCATCTCGG GC&GCGTTGG GTCCTGGCC& 
CCTGTCGTTG &GG&CCCGGC T&GGCTGGCG 
AGAATGAATC ACCGATACGC GAGCGAACGT 
ACGTCTGCGA CCTG&GC&&C &&C&TG&&TG 
GT&&AGTCTG G&&&CGCGG& &GTC&GCGCC 
TCTGCATCGC AGG&TGCTGC TGGCT&CCCT 
TT&ACGAAGC CTTTCTCA&T GCTC&CGCTG 



6CC&GGTG&G 


CTCTGGCCGT 


7550 


TTTTTGATGC 


GTTTCTT&CC 


7600 


GGTG&CGMA 


&GGCTGTCC© 


7650 


CCTcGACCGA 


TGCCCTTGAG 


7700 


GGCGCGGGGC 


&TGACT&TCG 


7750 


TGCM.CTCGT &GG&CAGGTG 


7@00 


G&CCGCTTTC 


GCTG6&GCGC 


7850 


CGGM^TCTTG 


CACGCCCTCG 


7900 


M^CGTTTCGG 


CG&G&MiC&G 


7S50 


eTGGGCTACG 


TGTTGOSGGG 


sooo 


C&TT&TG&TT 


CTTCSCGGT2 


■ 8050 


CC&TGCTGTC 


C&G6C&GGT& 


83.00 


TCGCTCGCGG 


CSCFE&CChG 


.8150 


C&CGGCG&TT 


TATGCCGCCT 


8200 


TTGT&GGCGC 


CGCCCT&T&C 


8250 


6C&TGG&GCC 


GGGCC&CCTC 


8300 




CC&CTCC&&6 


8350 


TGTG&&TGCG 


CMACCM.CC 


8400 


TCTCC&GC&G 


CCGC&CGCGG 


8450 


CGGGTGCGCA 


TGATCGTGCT 


8500 


GGGTTGCCTT ACTGGTTAGC 


8550 


G&AGCG&CTG 


CTGCTGCAM. 


8600 


GTCTTCGGTT 


TCCGTGTTTC 


8650 


CTGCACCATT 


&TGTTCCGGA 


8700 


GTGG&&C&CC 


T&C&TCTGTA 


8750 


T&GGT&TCTC 


&GTTCGGTGT 


8800 
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AGGTCGTTCG 


CTCCAAGCTG 


GGCTGTGTGC 


ACGAACCCCC 


CGTTCAGCCC 


8850 


GACCGCTGCG 


CCTTATCCGG 


TAACTATCGT 


CTTGAGTCCA 


ACCCGGTAAG 


8900 


ac&cgactta 


TCGCCACTGG 


CAGCAGCCAC 


TG6T&ACAGG 


ATTAGCAGAG 


8950 


CGAGGTATGT 


AGGCGGTGCT 


ACAGAGTTCT 


TGAAlaTGGTG 


GCCTAACTAC 


9000 


GGCTACACTA 


GAAGGACAGT 


ATTTGGTATC 


TGCGCTCTGC 


TGAAGCC&GT 


9050 


TACCTTCGGA 


AAAAGAGTTG 


GTAGCTCTTG 


ATCCGGCAAA 


CAAACCACCG 


9100 


CTGGTAGCGG 




GTTTGCAAGC 


AGCAGATTAC 


GCGCAGAAAA 


9150 


AAAGGATCTC 


AAGAAGATCC 


TTTGATCTTT 


TCTACGGGGT 


CTGACGCTCA 


9200 


GTGGAACGAA 


AACTCACGTT 


AAGGGATTTT 


GGTCATGAGA 


TTATCAAAAA 


9250 


GGATCTTCAC 


CTAGATCCTT 


TTAAATTAAA 


AATGAAGTTT 


TAAATCAATC 


9300 


TAAAGTATAT 


ATGAGT&AAC 


TTGGTCTGAC 


AGTTACCAAT 


GCTTAATCAG 


9350 


TGAGGCACCT 


ATCTCAGCGA 


TCTGTCTATT 


TCGTTCATCC 


ATAGTTGCCT 


9400 


GACTCCCCGT 


CGTGTAGATA 


ACTACGATAC 


GGGAGGGCTT 


ACCATCTGGC 


9450 


CCCAGTGCTG 


CAATGATACC 


GCGAGACCCA 


CGCTCACCGG 


CTCCAGATTT 


9500 


ATCAGCAATA 


AACCAGCCAG 


CCGGAAGGGC 


CGAGCGCAGA 


AGTGGTCCTG 


9550 


CAACTTTATC 


CGCCTCCATC 


CAGTCTATTA 


ATTGTTGCCG 


GGAAGCTAGA 


9600 


GTAAGTAGTT 


CGCC&GTTAA 


TAGTTTGCGC 


AACGTTGTTG 


CCATTGCTGC 


9650 


AGGCATCGTG 


GTGTCACGCT 


CGTCGTTTGG 


TATGGCTTCA 


TTCAGCTCCG 


9700 


GTTCCCAACG 


ATCAAGGCGA 


GTTACATGAT 


CCCCCATGTT 


GTGCAAAAAA 


9750 


GCGGTTAGCT 


CCTTCGGTCC 


TCCGATCGTT 


GTCAGAAGTA 


AGTTGGCCGC 


9800 




bJlUUwil In 


A \ssa WiVs G&C 1 


GGATAATTCT 


CTTACTGTCA 


no c n 
9850 


TGCCATCCGT 


AAGATGCTTT 


TCTGTGACTG 


GTGAGTACTC 


AACCAAGTCA 


9900 


TTCTGAGAAT 


AGTGTATGCG 


GCGACCGAGT 


TGCTCTTGCC 


CGGCGTCAAC 


9950 


ACGGGATAAT 


ACCGCGCCAC 


ATAGCAGAAC 


TTTAAAAGTG 


CTCATCATTG 


10000 


GAAAACGTTC 


TTCGGGGCGA 


AAACTCTCAA 


GGATCTTACC 


GCTGTTGAGA 


10050 


TCCAGTTCGA 


TGTAACCCAC 


TCGTGCACCC 


AACTGATCTT 


CAGCATCTTT 


10100 
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TACTTTCACC 


AGCGTTTCTG 


3GTGAGCAAA 


AACAGGAAGG 


CAAAATGCCG 


10150 


CAAAAAAGGG 


AATAAGGGCG 


ACACGGAAAT 


GTTGAATACT 


CATACTCTTC 


10200 


CTTTTTCAAT 


ATTATTGAAG 


CATTTATCAG 


GGTTATTGTC 


TCATGAGCGG 


10250 


ATACATATTT 


GAATGTATTT 


AGAAAAATAA ACAAATAGGG 


GTTCCGCGCA 


10300 


CATTTCCCCG 


AAAAGTGCCA 


CCTGACGTCT 


AAGAAACCAT 


TATTATCATG 


10350 


ACATTAACCT 


ATAAAAATAG 


GCGTATCACG 


AGGCCCTTTC 


GTCTTCAA 


10398 


(2) INFORMATION FOR SEQ ID NO: 2: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 






(ii) MOLECULE TYPE: CDNA 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 






TCGCGCGTTT 


CGGTGATGAC 


GGTGAAAACC 


TCTGACACAT 


GCAGCTCCCG 


50 


GAGACGGTCA 


CAGCTTGTCT 


GTAAGCGGAT 


GCCGGGAGCA 


GACAAGCCCG 


100 


TCAGGGCGCG 


TCAGCGGGTG 


TTGGCGGGTG 


TCGGGGCTGG 


CTTAACTATG 


150 


CGGCATCAGA 


GCAGATTGTA 


CTGAGAGTGC 


ACCATATGCG 


GTGTGAAATA 


200 


CCGCACAGAT 


GCGTAAGGAG 


AAAATACCGC 


ATCAGGCGCC 


ATTCGCCATT 


250 


CAGGCTGCGC 


AACTGTTGGG 


AAGGGCGATC 


GGTGCGGGCC 


TCTTCGCTAT 


300 


TACGCCAGCT 


GGCGAAAGGG 


GGATGTGCTG 


CAAGGCGATT 


AAGTTGGGTA 


350 


ACGCCAGGGT 


TTTCCCAGTC 


ACGACGTTGT 


AAAACGACGG 




400 

m 9 W V 


GCTTGCATGC 


CTGCAGGTCG 


ACTCTAGAGG 


ATCCGAAAAA 


ACCTCCCACA 


450 


CCTCCCCCTG 


AACCTGAAAC 


ATAAAATGAA 


TGCAATTGTT 


GTTGTTAACT 


500 


TGTTTATTGC 


AGCTTATAAT 


GGTTACAAAT 


AAAGCAATAG 


CATCACAAAT 


550 


TTCACAAATA 


AAGCATTTTT 


TTCACTGCAT 


TCTAGTTGTG 


GTTTGTCCAA 


600 


ACTCATCAAT 


GTATCTTATC 


ATGTCTGGAT 


CCCCGCGGCC 


GCCAAATCAT 


650 
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TTATTGTTCA AAGATGCAGT CATCCAAATC 
CA6TGCAAGC GTCTG6CACC TTTCCCATGA 
TTCTGATACG CCTTTTTGAC GACAGAAACG 
AAAGCACTCT AAACAGTCTT TCTGTCCGTG 
TCTGATTCAT TCTCTCGCAT TGTCTGCAGG 
CCCACGTGAC GAGAACATTT GTTTTGGTAC 
AGCTTCCGCG TCTGACGTCG ATGGCTGCGC 
TGGGCTCACT TATATCTGCG TCACTGGGGG 
CCCTTTTTGA CGTAGAATTC ATGCTCCACC 
CCACCGGAAA AAGTCTTTGA CTTCCTGCTT 
GATCCAGACG GCGGGTGAGT TCAAATTTGA 
TGCTGGTGTT CGAAGGTCGT TGAGTTCCCG 
GGTGTTGGAG GTGACGATCA CGGGAGTCGG 
TGCATTTCTG GTCCACGCGC ACCTTGCTTC 
GACTCCACGA CCTTGGCGGT CATCTTCCCC 
CTTGTCGACA CAGTCGTTGA AGGGAAAGTT 
ACCCGTAGAA GGGCACAGTG TGGGCTATGG 
CCGGTAGTTG CAGGCCCAAA CAGCCAGATG 
TTTCGTGGCC CATCCCAGAA AGACGGAAGC 
CGTTTAGTTC CAAAATTTTA TAAATCCGAT 
GGCTGCTGGC CCACCAGGTA GTCGGGGGCG 
CTTTCCCGCA TTGTCCAAGG CAGCCTTGAT 
CCGCATTGAA GGAGATGTAT GAGGCCTGGT 
TCCGAGGTAA TCCCCTTGTC CACGAGCCAC 
GGCTGAAGTT TTTGATCTGA TCACCGGCGC 
TCTCTTTGTT CTGCTCCTGC GTCTGCGACA 



CACATTGACC 


AGATCGCAGG 


700 


TATGATGAAT 


GTAGCACAGT 


750 


GGTTGAGATT 


CTGACACGGG 


800 


AGTGAAGCAG 


ATATTTGAAT 


850 


GAAACAGCAT 


CAGATTCATG 


900 


CTGTCTGCGT 


AGTTGATCGA 


950 


AACTGACTCG 


CGCACCCGTT 


1000 


CGGGTCTTTT 


CTTGGCTCCA 


1050 


TCAACCACGT 


GATCCTTTGC 


1100 


GGTGACCTTC 


CCAAAGTCAT 


1150 


ACATCCGGTC 


TTGCAACGGC 


1200 


TCAATCACGG 


CGCACATGTT 


1250 


GTCTATCTGG 


GCCGAGGACT 


1300 


CTCCGAGAAT 


GGCTTTGGCC 


1350 


TCCTCCCACC 


AGATCACCAT 


1400 


CTCATTGGTC 


CAGTTTACGC 


1450 


CCTCCGCGAT 


GTTGGTCTTC 


1500 


GTGTTCCTCT 


TGCCGAACTT 


1550 


CGCATATTGG 


GGATCGTACC 


1600 


TGCTGGAAAT 


GTCCTCCACG 


1650 




vjVj x-Tl CATAAT 


1700 


TTGGGACCGC 


GAGTTGGAGG 


1750 


CCTCCTGGAT 


CCACTGCTTC 


1800 


CCGACCAGCT 


CCATGTACCT 


1850 


ATCAGAATTG 


GGATTCTGAT 


1900 


CGTGCGTCAG 


ATGCTGCGCC 


1950 
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ACCAACCGTT 


T&CGCTCCGT 


SAGATTCAAA 


CAGGCGCTTA 


AATACTGTTC 


2000 


C&TATTAGTC 


CACGCCCACT 


GGAGCTCAGG 


CTGGG'JMMtTG 


GGGAGCAAGT 


2050 


A&TTGGGGAT 


GTAGCACTCA 


TCC&CCACCT 


TGTTCCCGCC 


TCCGGCGCCA 


2100 


TTTCTGGTCT 


TTGTGACCGC 


G&ACCAGTTT 


GGCAAAGTCG 


GCTCG&TCCC 


2150 


6CGGTAA&TT 


CTCTG&&TCA 


GTTTTTCGCG 


AATCTGACTC 


AGGAAACGTC 


2200 


CCAAAACCAT 


GGATTTCACC 


CCGGTGGTTT 


CCACGAGCAC 


GTGC&TGTGG 


2250 


AM3TA6CTCT 


CTCCCTTCTC 


AAATTGCACA 


AAGAAAAGGG 


CCTCCGGGGC 


2300 


CTT&CTC&CA 


CGGCGCCATT 


CCGTCAGAAA 


GTCGCGCTGC 


AGCKTCTCGG 


2350 


CCACGGTCAG 


GGGTGCCTGC 


TCAATCAGAT 


TCAGATCCAT 


©TCAGAATCT 


2400 


GGCGGC&ACT 


CCCATTCCTT 


CTCGGCCACC 


CAGTTC&C&& 


AGCTGTCAGA 


2450 


AATGCCGGGC 


AGATGCCCGT 


CAAGGTCGCT 


GGGGACCTTA 


ATCACAATCT 


2500 


eGT&AA&ecc 


CGGCATGGCG 


GCTGCGCGTT 


CAAAcerecc 


GCTTCAAAAT 


2550 


GG&GACCCTG 


CGTGCTCACT 


CGGGCTTAAA 


TACCCAGCGT 


GACCACATGG 


2600 


TGTCGCAAAA 


TGTCGCAAAA 


CACTCACGTG 


ACCTCTAATA 


CAGGACTCTA 


2650 


GAGGATCCCC 


GGGTACCGAG 


CTCGAATTCG 


TAATCATGGT 


CATAGCTGTT 


2700 


TCCTGTGTGA 


A&TTGTTATC 


CGCTCACAAT 


TCCAGACAAC 


ATACGAGOEG 


2750 


GAAGCATAAA 


GTGTAAAGCC 


TGGGGTGCCT 


AATGAGTGAG 


CTAACTGACA ' 


2800 


TTA&TTGCGT 


TGCGCTCACT 


GCCCGCTTTC 


CAGTOGGGAA 


ACCTGTCGTG 


2850 


CC&GCT6C&T 


TAATGAATCG 


GCCAACGCGC 


GGGGAGAGGC 


GGTTTGCGTA 


2900 


TTGGGCGCTC 


TTCCGCTTCC 


TCGCTCACTG 


ACTCGCTGCG 


CTCGGTCGTT 


2950 


CGGCTGCGGC 


GAGCGGTATC 




AAGGCGGXAA 


TACGGTTATC 




C&C&GA&TCA 


GGGGATAACG 


CAGGAAAGAA 


CATGTGAGCA 


AAAGGCCAGC 


3050 


AAAAGGCCAG 


GAACCGTAAA 


AAGGCCGCGT 


TGCTGGCGTT 


TTTCCATAGG 


3100 


CTCCGCCCCC 


CTGACGAGCA 


TCACAAAAAT 


CGACGCTCAA 


GTCAGAGGTG 


3150 


GCGA&&CCCG 


ACAGGACTAT 


AAAGAT&CCA 


GGCGTTTCCC 


CCTGGAAGCT 


3200 


CCCTCGTGCG 


CTCTCCTGTT 


CCGACCCTGC 


CGCTTACCGG 


&TACCTGTCC 


3250 



'" WO 96/13398 




60 

GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 
GTATCTCAGT TCGGTGTAGG TCGTTCGCTC 
AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
GAGTCCAACC CGGTAAGACA CGACTTATCG 
TAACAGGATT AGCAGAGCGA GGT&TGTAGG 
AGTGGTGGCC TAACT&CGGC TACACT&GAA 
GCTCTGCTGA AGCCAGTTAC C3JTCGGAAAA 
CGGCAAACAA ACCACCGCTG GTA6CGGTGG 
AGATTACGCG CAGAAAAAAA GGATCTCAAG 
ACGGGGTCTG ACGCTCAGTG 6AACGAAAAC 
CATGAGATTA TCAAAAAGGA TCTTCACCTA 
GAAGTTTTAA ATCAATCTAA AGTATATATG 
TACCAATGCT TAATC&GTGA GGC&CCTATC 
TTCATCCATA GTTGCCTGAC TCCCCGTCGT 
AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 
TCACCGGCTC CAGATTTATC AGC&ATAAAC 
GCGCAGAAGT GGTCCTGCAA CTTTATCCGC 
GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 
GTTGTTGCCA TTGCTACAGG CATCGTGGTG 
GGCTTCATTC AGCTCCGGTT CCCAACGATC 
CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 
AGAAGTAAGT TGGCCGCAGT GTTATCACTC 
TAATTCTCTT ACTGTCATGC CATCC6TAAG 
AGTACTCAAC CAAGTCATTC TGAGAATAGT 
TCTTGCCCGG CGTCAATACG GGATAATACC 
AAAAGTGCTC ATCATTGGAA AACGTTCTTC 
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TCTCATAGCT 


CACGCTGTAG 


3300 


CAAGCTGGGC 


TGTGTGCACG 


3350 


TATCCGGTAA 


CTATCGTCTT 


3400 


CCAC1-JGCAG 


C&GCCACTGG 


3450 


CGGTGCTACA 


GAGTTGTTGA 


3500 


GGACAGTATT 


TGGTATCT6C 


3550 


AGAGTTGGTA 


GCTCTTGATC 


3600 


TTTTTTTGTT 


TGCAAGCAGC 


3650 


AAGATCCTTT 


GATCTTTTCT 


3700 


TCACGTTAAG 


GGA*ETffTGGT 


3750 


GATCCTTfTA 


AA7TAAAAAT 


3800 


AGTAAACTTG 


GTCTGACAGT 


3850 


TCAGCGATCT 


GTCTATTTCG 


3900 


GTAGATAACT 


ACGATACGGG 


3950 


TGATACCGCG 


AGACCCACGC 


4000 


CAGCCAGCCG 


GAAGGGCCGA 


4050 


CTCCATCCAG 


TCTATTAATT 


4100 


CAGTTAATAG 


TTTGCGCAAC 


4150 


TCACGCTCGT 


CGTTTGGTAT 


4200 


AAGGCGAGTT 


ACATGATCCC 


4250 


TCGGTCCTCC 


GATCGTTGTC 


4300 


ATGGTTATGG 


CAGCACTGCA 


4350 


ATGCTTTTCT 


GTGACTGGTG 


4400 


GTATGCGGCG 


ACCGAGTTGC 


4450 


GCGCCACATA 


GCAGAACT7T 


4500 


GGGGCGAAAA 


CTCTCAAGGA 


4550 
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TCTTACCGCT GTTGAGATCC AGTTCGATGT 
TGATCTTCAG CATCTTTTAC TTTCACCAGC 
AGGAAGGCAA AATGCCGCAA AAAAGGGAAT 
GAATACTCAT ACTCTTCCTT TTTCAATATT 
TATTGTCTCA TGAGCGGATA CATATTTGAA 
AATAGGGGTT CCGCGCACAT TTCCCCGAAA 
AAACCATTAT TATCATGACA TTAACCTATA 
CCCTTTCGTC 
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AACCCACTCG 


TGCACCCAAC 


4600 


GTTTCTGGGT 


GAGCAAAAAC 


4650 


AAGGGCGACA 


CGGAAATGTT 


4700 


ATTGAAGCAT 


TTATCAGGGT 


4750 


TGTATTTAGA AAAATAAACA 


4800 


AGTGCCACCT 


GACGTCTAAG 


4850 


AAAATAGGCG 


TATCACGAGG 


4900 






4910 
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WHAT IS CLAIMED IS. 



1. 



A recombinant hybrid virus comprising: 
(a) DNA sequences of, or corresponding 



to, the 5* inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5* adenovirus packaging/enhancer 
domain ; 

(b) DHA sequences of, or corresponding 
to, the 5' adeno-associated virus (AAV) ITR sequences; 

(c) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(d) DNA sequences of, or corresponding 
to, the 3' AAV ITR sequences; 

(e) DNA sequences of, or corresponding 
to, the 3' adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell. 

2. The virus according to claim 1 wherein 
said adenovirus is rendered replication defective by a 
deletion in all or a part of the El gene. 



3. The virus according to claim 2 wherein 
said adenovirus genome has a deletion in all or a part of 
the E3 gene. 
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4. The virus according to claim 1 wherein 
said adenovirus genome comprising deletions in the DMA 
sequences of all or a portion of the adenovirus genes 
selected from the group consisting of the E2a gene, the 
E4 gene, the late genes LI through L5, the intermediate 
genes IX and IV a , and a combination thereof. 

5. The virus according to claim 1 wherein 
said selected gene is a reporter gene. 

6. The virus according to claim 5 wherein 
said reporter gene is selected from the group consisting 
of the genes encoding B-galactosidase, alkaline 
phosphatase and green fluorescent protein. 

7. The virus according to claim 1 wherein 
said selected gene is a therapeutic gene. 

8. The virus according to claim 7 wherein 
said therapeutic gene is selected from the group 
consisting of a normal CFTR gene and a normal LDL gene. 

9. The virus according to claim 1 further 
comprising the DNA of , or corresponding to, a functional 
portion of the genome of an adeno-associated virus rep 
gene. 

10. A recombinant hybrid vector comprising: 

(a) DNA sequences of, or corresponding 
to, the 5" inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5 1 adenovirus packaging/ enhancer 
domain; 

(b) DNA sequences of, or corresponding 
to, the 5' adeno-associated virus (AAV) ITR sequences; 
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(c) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(d) DNA sequences of, or corresponding 
to, the 3' AAV ITR sequences; 

(e) DNA sequences of, or corresponding 
to , the 3 1 adenovirus ITR sequences ; and 

(d) plasmid DNA sequences containing 
regulatory elements. 

11. A recombinant trans-infection particle 
comprising: 

(a) a recombinant hybrid virus comprising: 

(i) DNA sequences of, or corresponding 
to, the 5 V inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5 • adenovirus packaging/ enhancer 
domain; 

(ii) DNA sequences of, or corresponding 
to, the 5 1 adeno-associated virus (AAV) ITR sequences; 

(iii) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(iv) DNA sequences of, or corresponding 
to , the 3 1 AAV ITR sequences ; 

(v) DNA sequences of, or corresponding 
to, the 3 9 adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell; 

(b) a polycation sequence conjugated to said 
hybrid virus; and 
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(c) a plasmid comprising an AAV rep gene 
operatively linked to regulatory sequences capable of 
directing its expression, said plasmid associated with 
said polycation sequence. 

12. The trans-infection particle according to 
claim 11 wherein said adenovirus DNA lacks the sequence 
encoding viral genes. 

13. The trans-infection particle according to 
claim 11 wherein said adenovirus genome is rendered 
replication-defective by a deletion in all or a part of 
the El gene. 

14. The particle according to claim 13 wherein 
said adenovirus genome has a deletion in all or a part of 
the E3 gene. 

15. The particle according to claim 11 wherein 
said adenovirus genome has deletions in the DNA sequences 
of all or a portion of the adenovirus genes selected from 
the group consisting of the E2a gene, the E4 gene, the 
late genes LI through L5, the intermediate genes IX and 
IV a , and a combination thereof. 

16. The particle according to claim 11 wherein 
said selected gene is a reporter gene. 

17. The particle according to claim 16 wherein 
said reporter gene is selected from the group consisting 
of the genes encoding 8-galactosidase, alkaline 
phosphatase and green fluorescent protein. 

18. The particle according to claim 11 wherein 
said selected gene is a therapeutic gene. 
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19. The particle according to claim 18 wherein 
said therapeutic gene is selected from the group 
consisting of a normal CFTR gene and a normal LDL gene. 

20. A composition for use in delivering and 
stably integrating a selected gene into the chromosome of 
a target cell, said composition comprising 

(a) a recombinant hybrid virus 

comprising: 

(i) DNA sequences of, or 
corresponding to, the 5 1 inverted terminal repeat (ITR) 
sequences of an adenovirus and the 5 ■ adenovirus 
packaging/ enhancer domain; 

(ii) DNA sequences of, or 
corresponding to, the 5 1 adeno-associated virus (AAV) ITR 
sequences ; 

(iii) a gene encoding a selected 
protein operatively linked to regulatory sequences 
directing expression of the protein in a target cell in 
vivo or in vitro; 

(iv) DNA sequences of, or 
corresponding to, the 3' AAV ITR sequences; 

(v) DNA sequences of, or 
corresponding to, the 3* adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell; and 

(b) a pharmaceutically acceptable 

carrier. 

21. The composition according to claim 20 
further comprising a plasmid comprising an AAV rep gene 
under the control of regulatory sequences capable of 
expressing said rep gene in said target cell. 
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22. The composition according to claim 21 
wherein said vector further comprises the DNA of, or 
corresponding to, at least a functional portion of the 
genome of an adeno-associated virus rep gene. 

23. A composition for use in delivering and 
stably integrating a selected gene into the chromosome of 
a target cell comprising an effective amount of a 
recombinant trans-infection particle in a 
pharmaceutically acceptable carrier, said particle 
comprising: 

(a) a recombinant hybrid virus comprising: 

(i) DNA sequences of, or corresponding 
to, the 5 f inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5 V adenovirus packaging/enhancer 
domain; 

(ii) DNA sequences of, or corresponding 
to, the 5' adeno-associated virus (AAV) ITR sequences; 

(iii) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(iv) DNA sequences of, or corresponding 
to, the 3 1 AAV ITR sequences; 

(v) DNA sequences of, or corresponding 
to, the 3* adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell; 

(b) a polycation sequence conjugated to said 
hybrid virus; 
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(c) a plasmid comprising an AAV rep gene 
operatively linked to regulatory sequences capable of 
directing its expression, said plasmid associated with 
said polycation sequence. 

24. A mammalian cell capable of expressing a 
selected gene introduced therein through transduction of 
the virus of claim 1, the vector of claim 10, or the 
trans- infection particle of claim 11. 

25. A method for producing high levels of a 
recombinant adeno-associated virus comprising the steps 
of 

(a) culturing a cell co-transfected with 
the vector of claim 10 and an optional helper virus in 
the presence of a plasmid containing an AAV rep gene 
under the control of regulatory sequences capable of 
expressing said rep gene; and 

(b) isolating from said culture a 

recombinant AAV. 

26. A method for producing high levels of a 
recombinant adeno-associated virus comprising the steps 
of 

(a) culturing a cell transfected with the 

particle of claim 11 and 

(b) isolating from said culture a 

recombinant AAV. 



WO 96/13598 



PCI7US95/14018 



1/21 



Nhel(7) 

Bgin(365) 

EcoRI(1)OC\ 5 noBI ( 841 > /Xbal(116l) 

BamHI(1167) 

a °" 1 tto^>^— XhoI(1 1 77) 

^MV^Z BamHI(1 245) 

NotI(1355) 



pAd.AV.CMVLacZ B ri „ T / 9 , 16 \ 
(10.398 bp) LacZj — CWK2316) 



.Ad 9-16 

rTR_ 

•NotI(4829) 
Xbal(5043) / I ^BamHI(4839) 
BamHl(5037) 



Nhel DIGEST 



FIG. IA 



Bgin(5222) 



BamHI(1167) 



Xbal(1161) 
SnaBI(841) 




hoI(1177) 
iamHI(1245) 
NotI(1355) 

Clal(2316) 



BamHI(4839) 



amHI(5037) 
Xbal(5043) E coRI(1 0399) 
Bgin(5?22) 



0 Bgin(365) 



16 



2 Clal DIGESTED AD p|Q^ |g 



100 m.u. 



CO-TRANSFECTION INTO 
293 CELLS FOLLOWED BY 
INTRACELLULAR HOMOLOGOUS 
RECOMBINATION. 



o y— 



Clal(2316) 



100 m f u. 



Ad5 



AV.CMVLacZ 1 Ad5 

HYBRID Ad.AV.CMVLacZ 




WO 96/13598 

GAATTCGCTA GCATCATCAA 
GGGGGTGGAG TTTGTGACGT 
GGCGGAAGTG TGATGTTGCA 
TGACGTTTTT GGTGTGCGCC 
GGATGTTGTA GTAAATTTGG 
GAATAAGAGG AAGTGAAATC 
AGGGAGATCT GCTGCGCGCT 
GGGCGACCTT TGGTCGCCCG 
ACTCCATCAC TAGGGGTTCC 
TTCGAGCTTG CATGCCTGCA 
CCGCCCAACG ACCCCCGCCC 
ATAGGGACTT TCCATTGACG 
GTACATCAAG TGTATCATAT 
CCCGCCTGGC ATTATGCCCA 
TACGTATTAG TCATCGCTAT 
GGATAGCGGT TTGACTCACG 
TTGTTTTGGC ACCAAAATCA 
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FIGURE 2A 

TAATATACCT TATTTTGGAT 

GGCGCGGGGC GTGGGAACGG 

AGTGTGGCGG AACACATGTA 

GGTGTACACA GGAAGTGACA 

GCGTAACCGA GTAAGATTTG 

TGAATAATTT TGTGTTACTC 

CGCTCGCTCA CTGAGGCCGC 

GCCTCAGTGA GCGAGCGAGC 

TTGTAGTTAA TGATTAACCC 

GGTCGTTACA TAACTTACGG 

ATTGACGTCA ATAATGACGT 

TCAATGGGTG GAGTATTTAC 

GCCAAGTACG CCCCCTATTG 

GTACATGACC TTATGGGACT 

TACCATGGTG ATGCGGTTTT 

GGGATTTCCA AGTCTCCACC 

ACGGGACTTT CCAAAATGTC 




PCI7US95/14018 



TGAAGCCAAT ATGATAATGA 

60 



GGCGGGTGAC GTAGTAGTGT 

120 



AGCGACGGAT GTGGCAAAAG 

180 



ATTTTCGCGC GGTTTTAGGC 

240 

GCCATTTTCG CGGGAAAACT 

300 

ATAGCGCGTA ATATTTGTCT 

360 



CCGGGCAAAG CCCGGGCGTC 

420 



GCGCAGAGAG GGAGTGGCCA 

480 



GCCATGCTAC TTATCTACAA 

540 



TAAATGGCCC GCCTGGCTGA 

600 



ATGTTCCCAT AGTAACGCCA 

660 



GGTAAACTGC CCACTTGGCA 

720 



ACGTCAATGA CGGTAAATGG 

780 



TTCCTACTTG GCAGTACATC 

840 



GGCAGTACAT CAATGGGCGT 

900 



CCATTGACGT CAATGGGAGT 

960 



GTAACAACTC CGCCCCATTG 

1020 



WO 96/13598 




ACGCAAATGG GCGGTAGGCG 
AACCGTCAGA TCGCCTGGAG 
GACCGATCCA GCCTCCGGAC 
GTTAACTGGT AAGTTTAGTC 
CAAATCAAAG AACTGCTCCT 
TTACTTCTGC TCTAAAAGCT 
GAGCCTGCTA AAGCAAAAAA 
TTTTCGTTGC CGGTCTGGGA 
ATCCCGTCGT TTTACAACGT 
TTGCAGCACA TCCCCCTTTC 
CTTCCCAACA GTTGCGCAGC 
AAGCGGTGCC GGAAAGCTGG 
CCTCAAACTG GCAGATGCAC 
TTACGGTCAA TCCGCCGTTT 
TTAATGTTGA TGAAAGCTGG 
ACTCGGCGTT TCATCTGTGG 
TGCCGTCTGA ATTTGACCTG 
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FIGURE 2B 

TGTACGGTGG GAGGTCTATA 

ACGCCATCCA CGCTGTTTTG 

TCTAGAGGAT CCoGTACTCG 

TTTTTGTCTT TTATTTCAGG 

CAGTGGATGT TGCCTTTACT 

GCGGAATTGT ACCCGCGGCC 

GAAGTCACCA TGTCGTTTAC 

GGCATTGGTC TGGACACCAG 

CGTGACTGGG AAAACCCTGG 

GCCAGCTGGC GTAATAGCGA 

CTGAATGGCG AATGGCGCTT 

CTGGAGTGCG ATCTTCCTGA 

GGTTACGATG CGCCCATCTA 

GTTCCCACGG AGAATCCGAC 

CTACAGGAAG GCCAGACGCG 

TGCAACGGGC GCTGGGTCGG 

AGCGCATTTT TACGCGCCGG 
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TAAGCAGAGC TCGTTTAGTG 

1080 

ACCTCCATAG AAGACACCGG 

1140 

AGGAACTGAA AAACCAGAAA 

1200 

TCCCGGATCC GGTGGTGGTG 

1260 



TCTAGGCCTG TACGGAAGTG 

1320 

GCAATTCCCG GGGATCGAAA 

1380 



TTTGACCAAC AAGAACGTGA 

1440 



CAAGGAGCTG CTCAAGCGCG 

1500 

CGTTACCCAA CTTAATCGCC 

1560 

AGAGGCCCGC ACCGATCGCG 

1620 



TGCCTGGTTT CCGGCACCAG 

1680 

GGCCGATACT GTCGTCGTCC 

1740 

CACCAACGTA ACCTATCCCA 

1800 

GGGTTGTTAC TCGCTCACAT 

1860 

AATTATTTTT GATGGCGTTA 

1920 

TTACGGCCAG GACAGTCGTT 

1980 

AGAAAACCGC CTCGCGGTGA 

2040 



WO 96/13598 




T66TGCT6CG TTGGAGTGAC 
GCATTTTCCG TGACGTCTCG 
TTGCCACTCG CTTTAATGAT 
GCGGCGAGTT GCGTGACTAC 
TCGCCAGCGG CACCGCGCCT 
ATCGCGTCAC ACTACGTCTG 
ATCTCTATCG TGCGGTGGTT 
CCTGCGATGT CGGTTTCCGC 
AGCCGTTGCT GATTCGAGGC 
TGGATGAGCA GACGATGGTG 
TGCGCTGTTC GCATTATCCG 
TGTATGTGGT GGATGAAGCC 
CCGATGATCC GCGCTGGCTA 
ATCGTAATCA CCCGAGTGTG 
ATCACGACGC GCTGTATCGC 
AAGGCGGCGG AGCCGACACC 
ATGAAGACCA GCCCTTCCCG 
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FIGURE 2C 

GGCAGTTATC TGGAAGATCA 

TTGCTGCATA AACCGACTAC 

GATTTCAGCC GCGCTGTACT 

CTACGGGTAA CAGTTTCTTT 

TTCGGCGGTG AAATTATCGA 

AACGTCGAAA ACCCGAAACT 

GAACTGCACA CCGCCGACGG 

GAGGTGCGGA TTGAAAATGG 

GTTAACCGTC ACGAGCATCA 

CAGGATATCC TGCTGATGAA 

AACCATCCGC TGTGGTACAC 

AATATTGAAA CCCACGGCAT 

CCGGCGATGA GCGAACGCGT 

ATCATCTGGT CGCTGGGGAA 

TGGATCAAAT CTGTCGATCC 

ACGGCCACCG ATATTATTTG 

GCTGTGCCGA AATGGTCCAT 
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GGATATGTGG CGGATGAGCG 

2100 

ACAAATCAGC GATTTCCATG 

2160 



GGAGGCTGAA GTTCAGATGT 

2220 

ATGGCAGGGT GAAACGCAGG 

2280 

TGAGCGTGGT GGTTATGCCG 

2340 

GTGGAGCGCC GAAATCCCGA 

2400 



CACGCTGATT GAAGCAGAAG 

2460 

TCTGCTGCTG CTGAACGGCA 

2520 



TCCTCTGCAT GGTCAGGTCA 

2580 

GCAGAACAAC TTTAACGCCG 

2640 

GCTGTGCGAC CGCTACGGCC 

2700 

GGTGCCAATG AATCGTCTGA 

2760 

AACGCGAATG GTGCAGCGCG 

2820 

TGAATCAGGC CACGGCGCTA 

2880 



TTCCCGCCCG GTGCAGTATG 

2940 

CCCGATGTAC GCGCGCGTGG 

3000 

CAAAAAATGG CTTTCGCTAC 

3060 
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FIGURE 2D 



CTGGAGAGAC GCGCCCGCTG ATCCTTTGCG AATACGCCCA CGCGATGGGT AACAGTCTTG 

3120 



GCGGTTTCGC TAAATACTGG CAGGCGTTTC GTCAGTATCC CCGTTTACAG GGCGGCTTCG 

3180 

TCTGGGACTG GGTGGATCAG TCGCTGATTA AATATGATGA AAACGGCAAC CCGTGGTCGG 

3240 

CTTACGGCGG TGATTTTGGC GATACGCCGA ACGATCGCCA GTTCTGTATG AACGGTCTGG 

3300 

TCTTTGCCGA CCGCACGCCG CATCCAGCGC TGACGGAAGC AAAACACCAG CAGCAGTTTT 

3360 

TCCAGTTCCG TTTATCCGGG CAAACCATCG AAGTGACCAG CGAATACCTG TTCCGTCATA 

3420 

GCGATAACGA GCTCCTGCAC TGGATGGTGG CGCTGGATGG TAAGCCGCTG GCAAGCGGTG 

3480 



AAGTGCCTCT GGATGTCGCT CCACAAGGTA AACAGTTGAT TGAACTGCCT GAACTACCGC 

3540 

AGCCGGAGAG CGCCGGGCAA CTCTGGCTCA CAGTACGCGT AGTGCAACCG AACGCGACCG 

3600 

CATGGTCAGA AGCCGGGCAC ATCAGCGCCT GGCAGCAGTG GCGTCTGGCG GAAAACCTCA 

3660 

GTGTGACGCT CCCCGCCGCG TCCCACGCCA TCCCGCATCT GACCACCAGC GAAATGGATT 

3720 

TTTGCATCGA GCTGGGTAAT AAGCGTTGGC AATTTAACCG CCAGTCAGGC TTTCTTTCAC 

3780 

AGATGTGGAT TGGCGATAAA AAACAACTGC TGACGCCGCT GCGCGATCAG TTCACCCGTG 

3840 

CACCGCTGGA TAACGACATT GGCGTAAGTG AAGCGACCCG CATTGACCCT AACGCCTGGG 

3900 

TCGAACGCTG GAAGGCGGCG GGCCATTACC AGGCCGAAGC AGCGTTGTTG CAGTGCACGG 

3960 



CAGATACACT TGCTGATGCG GTGCTGATTA CGACCGCTCA CGCGTGGCAG CATCAGGGGA 

4020 

AAACCTTATT TATCAGCCGG AAAACCTACC GGATTGATGG TAGTGGTCAA ATGGCGATTA 

4080 
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CCGTTGATGT TGAAGTGGCG 
AGCTGGCGCA GGTAGCAGAG 
CCGACCGCCT TACTGCCGCC 
CCCCGTACGT CTTCCCGAGC 
GCCCACACCA GTGGCGCGGC 
TGATGGAAAC CAGCCATCGC 
ACGGTTTCCA TATGGGGATT 
TACAGCTGAG CGCCGGTCGC 
CGGGCAGGCC ATGTCTGCCC 
ACACAAACTT TTGGATGTTC 
ACTTCCCGTT TTTCCCGATT 
GTATTATTTT TGCCGCTATT 
TTTCTGACAA ACTCGGCCTC 
ATTGATGAGT TTGGACAAAC 
ATTTGTGATG CTATTGCTTT 
AACAATTGCA TTCATTTTAT 
CCTCTAGAGT CGAGTAGATA 
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FIGURE 2E 

AGCGATACAC CGCATCCGGC 

CGGGTAAACT GGCTCGGATT 

TGTTTTGACC GCTGGGATCT 

GAAAACGGTC TGCGCTGCGG 

GACTTCCAGT TCAACATCAG 

CATCTGCTGC ACGCGGAAGA 

GGTGGCGACG ACTCCTGGAG 

TACCATTACC AGTTGGTCTG 

GTATTTCGCG TAAGGAAATC 

GGTTTATTCT TTTTCTTTTA 

TGGCTACATG ACATCAACCA 

TCTCTGTTCT CGCTATTATT 

GACTCTAGGC GGCCGCGGGG 

CACAACTAGA ATGCAGTGAA 

ATTTGTAACC ATTATAAGCT 

GTTTCAGGTT CAGGGGGAGG 

AGTAGCATGG CGGGTTAATC 
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GCGGATTGGC CTGAACTGCC 

4140 

AGGGCCGCAA GAAAACTATC 

4200 

GCCATTGTCA GACATGTATA 

4260 

GACGCGCGAA TTGAATTATG 

4320 



CCGCTACAGT CAACAGCAAC 

4380 

AGGCACATGG CTGAATATCG 

4440 

CCCGTCAGTA TCGGCGGAAT 

4500 

GTGTCAAAAA TAATAATAAC 

4560 



CATTATGTAC TATTTAAAAA 

4620 

CTTTTTTATC ATGGGAGCCT 

4680 

TATCAGCAAA AGTGATACGG 

4740 

CCAACCGCTG TTTGGTCTGC 

4800 

ATCCAGACAT GATAAGATAC 

4860 

AAAAATGCTT TATTTGTGAA 

4920 

GCAATAAACA AGTTAACAAC 

4980 

TGTGGGAGGT TTTTTCGGAT 

5040 

ATTAACTACA AGGAACCCCT 

5100 
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AGTGATGGAG TTGGCCACTC 
AAA66TC6CC CGAC6CCCGG 
CAGATCTGGA AGGTGCTGAG 
6GC66TAAAC ATATTAGGAA 
GATCACTTGG TGCTGGCCTG 
TGAGGTACTG AAATGTGTGG 
CTTATGTAGT TTTGTATCTG 
GATGGAAGCA TTGTGAGCTC 
CAGAATGTGA TGGGCTCCAG 
TTGACCTACG AGACCGTGTC 
GCCGCTGCAG CCACCGCCCG 
AGCAGTGCAG CTTCCCGTTC 
TTGGATTCTT TGACCCGGGA 
CAGGTTTCTG CCCTGAAGGC 
CCAGACTCTG TTTGGATTTG 
CGCGCGCGGT AGGCCCGGGA 
AGGACGTGGT AAAGGTGACT 
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FIGURE 2F 
CCTCTCTGCG CGCTCGCTCG 
GCTTTGCCCG GGCGGCCTCA 
GTACGATGAG ACCCGCACCA 
CCAGCCTGTG ATGCTGGATG 
CACCCGCGCT GAGTTTGGCT 
GCGTGGCTTA AGGGTGGGAA 
TTTTGCAGCA GCCGCCGCCG 
ATATTTGACA ACGCGCATGC 
CATTGATGGT CGCCCCGTCC 
TGGAACGCCG TTGGAGACTG 
CGGGATTGTG ACTGACTTTG 
ATCCGCCCGC GATGACAAGT 
ACTTAATGTC GTTTCTCAGC 
TTCCTCCCCT CCCAATGCGG 
GATCAAGCAA GTGTCTTGCT 
CCAGCGGTCT CGGTCGTTGA 
CTGGATGTTC AGATACATGG 
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CTCACTGAGG CCGGGCGACC 

5160 



GTGAGCGAGC GAGCGCGCAG 

5220 

GGTGCAGACC CTGCGAGTGT 

5280 



TGACCGAGGA GCTGAGGCCC 

5340 



CTAGCGATGA AGATACAGAT 

5400 

AGAATATATA AGGTGGGGGT 

5460 

CCATGAGCAC CAACTCGTTT 

5520 

CCCCATGGGC CGGGGTGCGT 

5580 

TGCCCGCAAA CTCTACTACC 

5640 



CAGCCTCCGC CGCCGCTTCA 

5700 



CTTTCCTGAG CCCGCTTGCA 

5760 

TGACGGCTCT TTTGGCACAA 

5820 

AGCTGTTGGA TCTGCGCCAG 

5880 

TTTAAAACAT AAATAAAAAA 

5940 

GTCTTTATTT AGGGGTTTTG 

6000 

GGGTCCTGTG TATTTTTTCC 

6060 

GCATAAGCCC GTCTCTGGGG 

6120 
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FIGURE 26 



T66AG6TA6C ACCACTGCAG AGCTTCATGC TGCGGGGTGG TGTTGTAGAT GATCCAGTCG 

6180 



TAGCAGGAGC GCTGGGCGTG GTGCCTAAAA ATGTCTTTCA GTAGCAAGCT GATTGCCAGG 

€240 



GGCAGGCCCT TGGTGTAAGT GTTTACAAAG CGGTTAAGCT GGGATGGGTG CATACGTGGG 

6300 

GATATGAGAT GCATCTTGGA CTGTATTTTT AGGTTGGCTA TGTTCCCAGC CATATCCCTC 

6360 



CGGGGATTCA TGTTGTGCAG AACCACCAGC ACAGTGTATC CGGTGCACTT GGGAAATTTG 

6420 



TCATGTAGCT TAGAAGGAAA TGCGTGGAAG AACTTGGAGA CGCCCTTGTG ACCTCCAAGA 

6480 



TTTTCCATGC ATTCGTCCAT AATGATGGCA ATGGGCCCAC GGGCGGCGGC CTGGGCGAAG 

6540 

ATATTTCTGG GATCACTAAC GTCATAGTTG TGTTCCAGGA TGAGATCGTC ATAGGCCATT 

6600 



TTTACAAAGC GCGGGCGGAG GGTGCCAGAC TGCGGTATAA TGGTTCCATC CGGCCCAGGG 

6660 



GCGTAGTTAC CCTCACAGAT TTGCATTTCC CACGCTTTGA GTTCAGATGG GGGGATCATG 

6720 



TCTACCTGCG GGGCGATGAA GAAAACGGTT TCCGGGGTAG GGGAGATCAG CTGGGAAGAA 

6780 

AGCAGGTTCC TGAGCAGCTG CGACTTACCG CAGCCGGTGG GCCCGTAAAT CACACCTATT 

6840 

ACCGGGTGCA ACTGGTAGTT AAGAGAGCTG CAGCTGCCGT CATCCCTGAG CAGGGGGGCC 

6900 

ACTTCGTTAA GCATGTCCCT GACTCGCATG TTTTCCCTGA CCAAATCCGC CAGAAGGCGC 

6960 



TCGCCGCCCA GCGATAGCAG TTCTTGCAAG GAAGCAAAGT TTTTCAACGG TTTGAGACCG 

7020 

TCCGCCGTAG GCATGCTTTT GAGCGTTTGA CCAAGCAGTT CCAGGCGGTC CCACAGCTCG 

7080 

GTCACCTGCT CTACGGCATC TCGATCCAGC ATATCTCCTC GTTTCGCGGG TTGGGGCGGC 

7140 
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TTTCGCTGTA C6GCA6TAGT 
66C6CA66GT CCTCGTCAGC 
CGCTGGCCAG GGTGCGCTTG 
CCTGCGCGTC GGCCAGGTAG 
GGCCCTTGGC GCGCAGCTTG 
TGAGGGCGTA GAGCTTGGGC 
AGGCCCCGCA GACGGTCTCG 
AAACCAGGTT TCCCCCATGC 
GTCCACGCTC GGTGACGAAA 
CCTCGACCGA TGCCCTTGAG 
ATGACTATCG TCGCCGCACT 
CCGGCAGCGC TCTGGGTCAT 
GGCCTGTCGC TTGCGGTATT 
CCCGCCACCA AACGTTTCGG 
CTGGGCTACG TCTTGCTGGC 
CTTCTCGCTT CCGGCGGCAT 
GATGACGACC ATCAGGGACA 
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FIGURE 2H 
CGGTGCTCGT CCAGACGGGC 
GTAGTCTGGG TCACGGTGAA 
AGGCTGGTCC TGCTGGTGCT 
CATTTGACCA TGGTGTCATA 
CCCTTGGAGG AGGCGCCGCA 
GCGAGAAATA CCGATTCCGG 
CATTCCACGA GCCAGGTGAG 
TTTTTGATGC GTTTCTTACC 
AGGCTGTCCG TGTCCCCGTA 
AGCCTTCAAC CCAGTCAGCT 
TATGACTGTC TTCTTTATCA 
TTTCGGCGAG GACCGCTTTC 
CGGAATCTTG CACGCCCTCG 
CGAGAAGCAG GCCATTATCG 
GTTCGCGACG CGAGGCTGGA 
CGGGATGCCC GCGTTGCAGG 
GCTTCAAGGA TCGCTCGCGG 
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CAGGGTCATG TCTTTCCACG 

7200 

GGGGTGCGCT CCGGGCTGCG 

7260 

GAAGCGCTGC CGGTCTTCGC 

7320 



GTCCAGCCCC TCCGCGGCGT 

7380 

CGAGGGGCAG TGCAGACTTT 

7440 



GGAGTAGGCA TCCGCGCCGC 

7500 



CTCTGGCCGT TCGGGGTCAA 

7560 

TCTGGTTTCC ATGAGCCGGT 

7620 

TACAGACTTG AGAGGCCTGT 

7680 



CCTTCCGGTG GGCGCGGGGC 

7740 

TGCAACTCGT AGGACAGGTG 

7800 

GCTGGAGCGC GACGATGATC 

7860 



CTCAAGCCTT CGTCACTGGT 

7920 



CCGGCATGGC GGCCGACGCG 

7980 

TGGCCTTCCC CATTATGATT 

8040 

CCATGCTGTC CAGGCAGGTA 

8100 

CTCTTACCAG CCTAACTTCG 

8160 
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FIGURE 21 



ATCACTGGAC CGCTGATCGT CACGGCGATT TATGCCGCCT CGGCGAGCAC ATGGAACGGG 

8220 

TTGGCATGGA TTGTAGGCGC CGCCCTATAC CTTGTCTGCC TCCCCGCGTT GCGTCGCGGT 

8280 

GCATGGAGCC GGGCCACCTC GACCTGAATG GAAGCCGGCG GCACCTCGCT AACGGATTCA 

8340 



CCACTCCAAG AATTGGAGCC AATCAATTCT TGCGGAGAAC TGTGAATGCG CAAACCAACC 

8400 



CTTGGCAGAA CATATCCATC GCGTCCGCCA TCTCCAGCAG CCGCACGCGG CGCATCTCGG 

8460 

GCAGCGTTGG GTCCTGGCCA CGGGTGCGCA TGATCGTGCT CCTGTCGTTG AGGACCCGGC 

8520 

TAGGCTGGCG GGGTTGCCTT ACTGGTTAGC AGAATGAATC ACCGATACGC GAGCGAACGT 

8580 



GAAGCGACTG CTGCTGCAAA ACGTCTGCGA CCTGAGCAAC AACATGAATG GTCTTCGGTT 

8640 

TCCGTGTTTC GTAAAGTCTG GAAACGCGGA AGTCAGCGCC CTGCACCATT ATGTTCCGGA 

8700 

TCTGCATCGC AGGATGCTGC TGGCTACCCT GTGGAACACC TACATCTGTA TTAACGAAGC 

8760 



CTTTCTCAAT GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG 

8820 

GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT 

8880 

CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG 

8940 

ATTAGCAGAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC 

9000 

GGCTACACTA GAAGGACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA 

9060 

AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT 

9120 

GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT 

9180 
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FIGURE 2J 



TCTAC6666T CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA 

9240 

TTATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC 

9300 



TAAAGTATAT ATGAGTAAAC TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT 

9360 



ATCTCAGCGA TCTGTCTATT TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA 

9420 

ACTACGATAC GGGAGGGCTT ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA 

9480 



CGCTCACCGG CTCCAGATTT ATCAGCAATA AACCAGCCAG CCGGAAGGGC CGAGCGCAGA 

9540 



AGTGGTCCTG CAACTTTATC CGCCTCCATC CAGTCTATTA ATTGTTGCCG GGAAGCTAGA 

9600 



GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC AACGTTGTTG CCATTGCTGC AGGCATCGTG 

9660 



GTGTCACGCT CGTCGTTTGG TATGGCTTCA TTCAGCTCCG GTTCCCAACG ATCAAGGCGA 

9720 



GTTACATGAT CCCCCATGTT GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC TCCGATCGTT 

9780 



GTCAGAAGTA AGTTGGCCGC AGTGTTATCA CTCATGGTTA TGGCAGCACT GCATAATTCT 

9840 



CTTACTGTCA TGCCATCCGT AAGATGCTTT TCTGTGACTG GTGAGTACTC AACCAAGTCA 

9900 



TTCTGAGAAT AGTGTATGCG GCGACCGAGT TGCTCTTGCC CGGCGTCAAC ACGGGATAAT 

9960 



ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG CTCATCATTG GAAAACGTTC TTCGGGGCGA 

10020 



AAACTCTCAA GGATCTTACC GCTGTTGAGA TCCAGTTCGA TGTAACCCAC TCGTGCACCC 

10080 

AACTGATCTT CAGCATCTTT TACTTTCACC AGCGTTTCTG GGTGAGCAAA AACAGGAAGG 

10140 



CAAAATGCCG CAAAAAAGGG AATAAGGGCG ACACGGAAAT GTTGAATACT CATACTCTTC 

10200 
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FIGURE 2K 



CTTTTTCAAT ATTATTGAAG CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT 

10260 



GAATGTATTT AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA 

10320 



CCTGACGTCT AAGAAACCAT TATTATCATG ACATTAACCT ATAAAAATAG GCGTATCACG 

10380 



AGGCCCTTTC GTCTTCAA 
10398 




FIG. 3 
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TCGCGCGTTT CGGTGATGAC 
CAGCTTGTCT GTAAGCGGAT 
TTGGCGGGTG TCGGGGCTGG 
ACCATATGCG GTGTGAAATA 
ATTCGCCATT CAGGCTGCGC 
TACGCCAGCT GGCGAAAGGG 
TTTCCCAGTC ACGACGTTGT 
ACTCTAGAGG ATCCGAAAAA 
TGCAATTGTT GTTGTTAACT 
CATCACAAAT TTCACAAATA 
ACTCATCAAT GTATCTTATC 
AAGATGCAGT CATCCAAATC 
TTTCCCATGA TATGATGAAT 
GGTTGAGATT CTGACACGGG 
ATATTTGAAT TCTGATTCAT 
CCCACGTGAC GAGAACATTT 
TCTGACGTCG ATGGCTGCGC 
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FIGURE 5A 

GGTGAAAACC TCTGACACAT 

GCCGGGAGCA GACAAGCCCG 

CTTAACTATG CGGCATCAGA 

CCGCACAGAT GCGTAAGGAG 

AACTGTTGGG AAGGGCGATC 

GGATGTGCTG CAAGGCGATT 

AAAACGACGG CCAGTGCCAA 

ACCTCCCACA CCTCCCCCTG 

TGTTTATTGC AGCTTATAAT 

AAGCATTTTT TTCACTGCAT 

ATGTCTGGAT CCCCGCGGCC 

CACATTGACC AGATCGCAGG 

GTAGCACAGT TTCTGATACG 

AAAGCACTCT AAACAGTCTT 

TCTCTCGCAT TGTCTGCAGG 

GTTTTGGTAC CTGTCTGCGT 

AACTGACTCG CGCACCCGTT 
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GCAGCTCCCG GAGACGGTCA 

60 



TCAGGGCGCG TCAGCGGGTG 

120 

GCAGATTGTA CTGAGAGTGC 

180 



AAAATACCGC ATCAGGCGCC 

240 

GGTGCGGGCC TCTTCGCTAT 

300 

AAGTTGGGTA ACGCCAGGGT 

360 

GCTTGCATGC CTGCAGGTCG 

420 



AACCTGAAAC ATAAAATGAA 

480 

GGTTACAAAT AAAGCAATAG 

540 

TCTAGTTGTG GTTTGTCCAA 

600 

GCCAAATCAT TTATTGTTCA 

660 

CAGTGCAAGC GTCTGGCACC 

720 

CCTTTTTGAC GACAGAAACG 

780 



TCTGTCCGTG AGTGAAGCAG 

840 

GAAACAGCAT CAGATTCATG 

900 

AGTTGATCGA AGCTTCCGCG 

960 

TGGGCTCACT TATATCTGCG 

1020 



tVO 96/13598 




TCACTGGGGG CGGGTCTTTT 
TCAACCAC6T 6ATCCTTT6C 
CCAAAGTCAT 6ATCCAGACG 
TGCTGGTGTT CGAAGGTCGT 
GTGACGATCA CGGGAGTCGG 
ACCTTGCTTC CTCCGAGAAT 
TCCTCCCACC AGATCACCAT 
CAGTTTACGC ACCCGTAGAA 
CCGGTAGTTG CAGGCCCAAA 
CATCCCAGAA AGACGGAAGC 
TAAATCCGAT TGCTGGAAAT 
GTTTTAGTCA GGCTCATAAT 
GAGTTGGAGG CCGCATTGAA 
TCCGAGGTAA TCCCCTTGTC 
TTTGATCTGA TCACCGGCGC 
GTCTGCGACA CGTGCGTCAG 
CAGGCGCTTA AATACTGTTC 
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FIGURE 5B 

CTTGGCTCCA CCCTTTTTGA 

CCACCGGAAA AAGTCTTTGA 

GCGGGTGAGT TCAAATTTGA 

TGAGTTCCCG TCAATCACGG 

GTCTATCTGG GCCGAGGACT 

GGCTTTGGCC GACTCCACGA 

CTTGTCGACA CAGTCGTTGA 

GGGCACAGTG TGGGCTATGG 

CAGCCAGATG GTGTTCCTCT 

CGCATATTGG GGATCGTACC 

GTCCTCCACG GGCTGCTGGC 

CTTTCCCGCA TTGTCCAAGG 

GGAGATGTAT GAGGCCTGGT 

CACGAGCCAC CCGACCAGCT 

ATCAGAATTG GGATTCTGAT 

ATGCTGCGCC ACCAACCGTT 

CATATTAGTC CACGCCCACT 
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CGTAGAATTC ATGCTCCACC 

1080 

CTTCCTGCTT GGTGACCTTC 

1140 



ACATCCGGTC TTGCAACGGC 

1200 

CGCACATGTT GGTGTTGGAG 

1260 



TGCATTTCTG GTCCACGCGC 

1320 

CCTTGGCGGT CATCTTCCCC 

1380 

AGGGAAAGTT CTCATTGGTC 

1440 

CCTCCGCGAT GTTGGTCTTC 

1500 

TGCCGAACTT TTTCGTGGCC 

1560 

CGTTTAGTTC CAAAATTTTA 

1620 

CCACCAGGTA GTCGGGGGCG 

1680 

CAGCCTTGAT TTGGGACCGC 

1740 

CCTCCTGGAT CCACTGCTTC 

1800 

CCATGTACCT GGCTGAAGTT 

1860 



TCTCTTTGTT CTGCTCCTGC 

1920 

TACGCTCCGT GAGATTCAAA 

1980 

GGAGCTCAGG CTGGGTTTTG 

2040 
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FIGURE 5C 



GGGAGCAAGT AATTGGGGAT GTAGCACTCA TCCACCACCT TGTTCCCGCC TCCGGCGCCA 

2100 



TTTCTGGTCT TTGTGACCGC GAACCAGTTT GGCAAAGTCG GCTCGATCCC GCGGTAAATT 

2160 

CTCTGAATCA GTTTTTCGCG AATCTGACTC AGGAAACGTC CCAAAACCAT GGATTTCACC 

2220 



CCGGTGGTTT CCACGAGCAC GTGCATGTGG AAGTAGCTCT CTCCCTTCTC AAATTGCACA 

2280 



AAGAAAAGGG CCTCCGGGGC CTTACTCACA CGGCGCCATT CCGTCAGAAA GTCGCGCTGC 

2340 



AGCTTCTCGG CCACGGTCAG GGGTGCCTGC TCAATCAGAT TCAGATCCAT GTCAGAATCT 

2400 



GGCGGCAACT CCCATTCCTT CTCGGCCACC CAGTTCACAA AGCTGTCAGA AATGCCGGGC 

2460 



AGATGCCCGT CAAGGTCGCT GGGGACCTTA ATCACAATCT CGTAAAACCC CGGCATGGCG 

2520 

GCTGCGCGTT CAAACCTCCC GCTTCAAAAT GGAGACCCTG CGTGCTCACT CGGGCTTAAA 

2580 



TACCCAGCGT GACCACATGG TGTCGCAAAA TGTCGCAAAA CACTCACGTG ACCTCTAATA 

2640 



CAGGACTCTA GAGGATCCCC GGGTACCGAG CTCGAATTCG TAATCATGGT CATAGCTGTT 

2700 



TCCTGTGTGA AATTGTTATC CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA 

2760 

GTGTAAAGCC TGGGGTGCCT AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT 

2820 



GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC 

2880 



GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG 

2940 



CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC 

3000 

CACAGAATCA GGGGATAACG CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG 

3060 
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GAACC6TAAA AAGGCCGCGT 
TCACAAAAAT CGACGCTCAA 
GGCGTTTCCC CCTGGAAGCT 
ATACCTGTCC GCCTTTCTCC 
GTATCTCAGT TCGGTGTAGG 
TCAGCCCGAC CGCTGCGCCT 
CGACTTATCG CCACTGGCAG 
CGGTGCTACA GAGTTCTTGA 
TGGTATCTGC GCTCTGCTGA 
CGGCAAACAA ACCACCGCTG 
CAGAAAAAAA GGATCTCAAG 
GAACGAAAAC TCACGTTAAG 
GATCCTTTTA AATTAAAAAT 
GTCTGACAGT TACCAATGCT 
TTCATCCATA GTTGCCTGAC 
ATCTGGCCCC AGTGCTGCAA 
AGCAATAAAC CAGCCAGCCG 
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FIGURE 5D 

TGCTGGCGTT TTTCCATAGG 

GTCAGAGGTG GCGAAACCCG 

CCCTCGTGCG CTCTCCTGTT 

CTTCGGGAAG CGTGGCGCTT 

TCGTTCGCTC CAAGCTGGGC 

TATCCGGTAA CTATCGTCTT 

CAGCCACTGG TAACAGGATT 

AGTGGTGGCC TAACTACGGC 

AGCCAGTTAC CTTCGGAAAA 

GTAGCGGTGG TTTTTTTGTT 

AAGATCCTTT GATCTTTTCT 

GGATTTTGGT CATGAGATTA 

GAAGTTTTAA ATCAATCTAA 

TAATCAGTGA GGCACCTATC 

TCCCCGTCGT GTAGATAACT 

TGATACCGCG AGACCCACGC 

GAAGGGCCGA GCGCAGAAGT 
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CTCCGCCCCC CTGACGAGCA 

3120 

ACAGGACTAT AAAGATACCA 

3180 



CCGACCCTGC CGCTTACCGG 

3240 

TCTCATAGCT CACGCTGTAG 

3300 

TGTGTGCACG AACCCCCCGT 

3360 

GAGTCCAACC CGGTAAGACA 

3420 



AGCAGAGCGA GGTATGTAGG 

3480 

TACACTAGAA GGACAGTATT 

3540 

AGAGTTGGTA GCTCTTGATC 

3600 



TGCAAGCAGC AGATTACGCG 

3660 

ACGGGGTCTG ACGCTCAGTG 

3720 

TCAAAAAGGA TCTTCACCTA 

3780 

AGTATATATG AGTAAACTTG 

3840 

TCAGCGATCT GTCTATTTCG 

3900 

ACGATACGGG AGGGCTTACC 

3960 

TCACCGGCTC CAGATTTATC 

4020 

GGTCCTGCAA CTTTATCCGC 

4080 
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CTCCATCCAG TCTATTAATT 
TTTGCGCAAC GTT6TTGCCA 
GGCTTCATTC AGCTCCGGTT 
CAAAAAAGCG GTTAGCTCCT 
GTTATCACTC ATGGTTATGG 
ATGCTTTTCT GTGACTGGTG 
ACCGAGTTGC TCTTGCCCGG 
AAAAGTGCTC ATCATTGGAA 
GTTGAGATCC AGTTCGATGT 
TTTCACCAGC GTTTCTGGGT 
AAGGGCGACA CGGAAATGTT 
TTATCAGGGT TATTGTCTCA 
AATAGGGGTT CCGCGCACAT 
TATCATGACA TTAACCTATA 
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FIGURE 5E 
GTTGCCGGGA AGCTAGAGTA 
TTGCTACAGG CATCGTGGTG 
CCCAACGATC AAGGCGAGTT 
TCGGTCCTCC GATCGTTGTC 
CAGCACTGCA TAATTCTCTT 
AGTACTCAAC CAAGTCATTC 
CGTCAATACG GGATAATACC 
AACGTTCTTC GGGGCGAAAA 
AACCCACTCG TGCACCCAAC 
GAGCAAAAAC AGGAAGGCAA 
GAATACTCAT ACTCTTCCTT 
TGAGCGGATA CATATTTGAA 
TTCCCCGAAA AGTGCCACCT 
AAAATAGGCG TATCACGAGG 
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AGTAGTTCGC CAGTTAATAG 

4140 

TCACGCTCGT CGTTTGGTAT 

4200 

ACATGATCCC CCATGTTGTG 

4260 

AGAAGTAAGT TGGCCGCAGT 

4320 

ACTGTCATGC CATCCGTAAG 

4380 

TGAGAATAGT GTATGCGGCG 

4440 

GCGCCACATA GCAGAACTTT 

4500 

CTCTCAAGGA TCTTACCGCT 

4560 

TGATCTTCAG CATCTTTTAC 

4620 

AATGCCGCAA AAAAGGGAAT 

4680 

TTTCAATATT ATTGAAGCAT 

4740 

TGTATTTAGA AAAATAAACA 

4800 

GACGTCTAAG AAACCATTAT 

4860 

CCCTTTCGTC 
4910 
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